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THE SUPERPOSITION OF SEVERAL STRICTLY PERIODIC 
SEQUENCES OF EVENTS 


Ву D. В. COX anv W. L. SMITH 
Statistical Laboratory, University of Cambridge 


1. INTRODUCTION 


Suppose that there are a number of sowrces at each of which events occur from time to time. 
Suppose further that the outputs of the various sources are combined into one pooled output. 
We shall consider the problem of deducing the statistical properties of the pooled output 
from the statistical properties of the separate outputs. By a statistical property we mean, 
for example, the frequency distribution of the time interval between successive events. 
The problem is illustrated for three sources in Fig. 1. The events are marked by short 
vertical lines and the outputs of the sources S, 5, and S, are combined into a single pooled 
output P. 


Fig. l. The pooling of outputs. 


If we denote by % the time at which the jth event occurs оп the ith Source, we can state 
the problem formally as follows. Given N increasing sequences 5; = (59,9, sede = 1,..., М) 
let the {@ be rearranged into a single increasing sequence P. The statistical properties of 
the 8; are assumed known and we wish to determine various properties of the pooled 
output P. 

There are two cases of special interest. First, the intervals between successive events on 
any one source may be independent identically distributed random variables, the random 
variables associated with different sources also being independent. This situation is closely 
connected with renewal theory and will be discussed in a separate paper (Cox & Smith, 
1953). The second case, to which the present paper is devoted, arises when the events on 
any one source occur at exactly regular intervals so that the sequence 45; is [0;, 20,,...], 
where 0, is the period of the ith source. 

At first sight this is not а statistical problem at all, because the times at which events 
occur are completely determined and no probability or chance mechanism enters the system. 
That statistical concepts are in fact relevant follows from a well-known theorem in the 
theory of numbers due to H. Weyl. This states, in effect, that if the 0; are mutually irrational, 
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2 Superposition of several strictly periodic sequences of events 
the sources are random with respect to one another if observed after 
We make this remark precise in $ 2. 

Two applications, one in experimental 
will now be described. 


Conrad (1951) has investigated behaviour in situations calling for constant vigilance and 
for adaptive reactions to a changing pattern of events. He used 
of which a pointer revolved at constant speed, 
pointer passed a mark on its dial the operator was required to turn a knob. Conrad found 


how the number of responses that the operator omitted to make 
depended on the number and speed of the di 


a sufficiently long time. 


psychology and the other in neuro-physiology, 


a number of dials, on each 
different for different dials. Every time a 


and the error of timing 
als. Here the sequence of events on any one 
equence of instants at which response is called 
operator is likely to depend both on the mean 
nber of times events occur very close together. 


quency distribution of the interval between 
successive events in the pooled output.* 


One neurological application arose in the recent work by Fatt 


taneous subthreshold activity at motor-nerve endings. Т 
branching nerve endings there are a large number of 
disturbances into muscle fibre. Fatt 

the time interval between successive 
theoretical exponential distribution 

explicitly pointed out that the succes 
be periodic. The Statistic 


& Katz (1952) on spon- 
hey found that in certain many- 


active spots each passing electrical 
& Katz showed that the frequency distribution of 


disturbances in the muscle fibre agreed with the 
characteristic of a random series, However, they 


a single spot might nevertheless 
1 as possible about the individual 
ations on the pooled output. Another 
s and the events are discrete nerve pulses 
ant to the theory of neural nets 
older & Landahl, 1945). 
2. FREQUENCY DISTRIBUTION OF INTERVALS 1 
In this section we show how Weyl’s theorem (Weyl, 1916)t can be used to derive the fre- 
quency distribution of the interval between Successive events in the pooled output. It 
be assumed that the periods 0; are positiv. 


€ numbers, and are mutually irrational in the 
gative integers т, 


N THE POOLED OUTPUT 


2 not all zero, such that 
N ” 
È 0; = 0. (1) 
= 
Іп particular, the 0; ave all different and 0,0, is irrati 
sources are numbered so that 9. ; 410; is irrational for all 


precisely, if 11 i 
of (0), (20), ..., {m0} fallin 
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In fact, all the frequency distributions considered in this paper are defined in an analogous 
way by limiting frequencies in a precisely defined infinite sequence. The successive members 
of the sequence are in no sense independent or random, and hence we have refrained from 
using the term probability. 

'To reformulate (2) in terms of sources, consider any two sources, say the first two. 
Associate with the rth event on the first source a quantity 2, equal to the time between that 
event and the immediately preceding event on the second source (see Fig. 2). We have 


х, = 0.470,10,). (3) 
It follows from Weyl’s theorem that, since 0,/0, is irrational, the sequence (2, is uniformly 
distributed over (0, 02). 
(г-1)% гб, (r+1)0, 


Fig. 2. Definition of 2,. 


The generalized form of Weyl’s theorem states that if o, ..., о are irrational numbers, 
themselves mutually irrational in the sense of (1), then the sequences [nos], ..., Кта] are 
independently uniformly distributed over (0, 1). More precisely, if Z, is any portion of volume 
v of the k-dimensional unit cube, and if p,,(Z,) is the proportion of the m k-plets 


(o4)... (0335... (тад), ..., (mos) 


in J,, then I Pall) — v. (4) 


To apply this to our problem we define for the rth event on, say, the ith source a set of 
(N — 1) quantities af, ..., a(—5, a, ..., a( analogous to 2, in (3). For example, 20 is the 
interval between the rth event on the ith source and the immediately preceding event on 
the lth source. Then (4) shows that the sequences [2], ..., [4—20], [+], ..., [a] are inde- 
pendently uniformly distributed over (0, 04), ..., (0, Ө). 

We can now calculate the frequency distribution of the interval between successive 
events in the pooled output. For let q;(y) be the frequency function of the length of intervals 
ending with an event from source 2. The interval is between (y, y+ dy) if one source, say the 
jth, has its «-value between (у, у + dy) and if all the other sources have their z-values greater 
than y. The frequency of the first of these events is dy/0;, and of the second ДШ қ (0,-/у)/0,. 
Thus, by the independence of the x-values, 

1 (0.—у) 
900) = Ж Oust, “0%. (0<y<Oy). (5) 


Also when i = N there is a point concentration Qy of frequency at y = бу given by 


N-1(6,—0 
Qv = II ( zx м). (6) 
ксі u 
If i-- N there is no point frequency. i Р 
ч 1-2 
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The overall frequency distribution is defined by a frequency function q(y), and by a 
point frequency Q, and is obtained by taking a weighted average of the q;(y). In a long time 
the number of events from the ith source is proportional to 021, so that 


N N 
atv) = X opa] X oe 
i=1 i=1 
tees п 88 
= ни = л (0 & y & 04). (7) 
> 
1=1 0, 
DAS Ls) 
0 0, ; 
and Q2-—— E (8) 
гі 
4140; 


We shall write (7) and (8) іп two forms, one suitable for exact computation when ЈУ is 
small and the other suitable for deri ving asymptotic forms when М is large. 
To obtain an expression for computation, we introduce the symmetric functions 


oi? = 0,4+...4+0y, 
о = У 0,0, 


may OL MP 
i> ty 


A (9) 
M= Я б.д, 
eS? = 0,...0y. 
Then 00) =a X > 0 
! од Ш I — V) 
1 N-2 
“ар E Ce noezyo., (0<у<0,), (10) 
N-1 
0—8 
апа = "i V Ox) 
eu m 


The coefficient of y” in (10) isasum of i 
products of different (^s with (> issi 

ча | with (r + 2) mis; ence 
Y p must be a multiple of of... The numerical ве Жік. dus pes п” 

si i e number of ways of permuting the suffices ; j,k њи 
> 8 2, j, k. 
s ea e (11) define the frequency distribution of interval, led 
n s е ribution consists of a discrete frequency Q, at the cdm ий роо d 
When у is email м су curve between 0 and y defined bya demo таи 3 5) 
-— а 1 1а, = 
ia нн 4 0) and 1 1) ате easily computed numerically; som : egres (2) :ven 
55 ym etrie functions, o ean нь > Some examples are give? 
mined multipliers, іі й 


The mean of the distribution ( 

д 10), (11) ean be fi 
ү" there are asymptotically 7071 events и ee rici 
T0; events altogether. Therefore © ith source, 


ky) =| X ial 
ізі 


principles, In a very lon% 
and hence asymptotic? 


(2) 
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The method used to obtain the frequency distribution of the interval between events in 
the pooled output can be extended to give the joint frequency distribution of successive 
intervals. 

3. ASYMPTOTIC PROPERTIES 
In this section we first show that if the 0, satisfy some weak conditions, the distribution (7) 
tends to the exponential distribution as N tends to infinity. 

Let us set y; = бг! and express у as а multiple of its mean by writing 


N 
г=у У Хе (13) 
i-1 


The frequency function, r(z). of z is then from (7) 


хп (1- 
ңа) = НЫ 
2) = > 
- 420) 
с dæ?’ n 
N zy; 
where plz) = П ( - 3) 3 (15) 
е i=l = Ху 
To discuss the limiting form of (15) we introduce the moments of the finite population 
" j > 
ХӘЛ. Put в = (Бхо] Х, | - 
“= (ХІ (5=2,3,...). 
У zX; 
Then logp(z) = X log(1 —= ) 
= zx 
2o, BA P ou tem 


SNe ЗАЗ UC 

To justify a limiting operation on (17) various assumptions about the у, are possible. 
They all have the effect of ensuring that no small group of periods is comparable with the 
mean interval (12) as N tends to infinity. The strongest assumption is: 

(ia) As N tends to infinity the y; have both an upper bound and а non-zero lower bound. 

This implies: 

(ib) As У tends to infinity the dimensionless moments (ШЕ ате bounded. 

A weaker assumption still is: 


(іс) Аз N tends to infinity руји“ < A*N*-? for some A, s = 2,3,.... 
Under any of (ia), (ib) or (ic) we can let N tend to infinity in (17) and obtain 


р(г) ~ e^, (18) 
and ка) = фи €) ens, (19) 


The differentiation of (18) is justified because p(z) for all finite N and the limit function 
€? are integral functions. If we retain the term in N^, we get under assumption (ia) or 


(ib) the second approximation 02) (224 > 
де ер- )(22- 2], (20) 


2N 


where C? = 3/02 — 1 is the squared coefficient of variation of the y;. 
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i Е 2< ух!% [ - at the 
The continuous frequency curve, 7(z), has range 0 € z < yy! У; We shall assume that t 


smallest period 0, is large compared with the mean interval, i.e. that 


у N 21 ) 
(ii) lim хм У y; = oo, (2 
No i-1 
and it can then be shown that the point frequency (8) tends to zero. Thus the continuous 
frequency curve accounts asymptotically for the whole of the frequency. 
We have shown that under assumptions (i) and (ii) 


interval between successive events on the pooled output tends to the exponential curve. 


It can be shown similarly that successive intervals tend to be distributed in independent 
exponential distributions. These results are but one aspect of what may be called the local 
randomness of the pooled output. As another illustration of local r. 
that, if #18 small compared with all the periods 0 
t follows a Poisson distribution of mean ((Xy;) 
events in time / is defined by taking 
over a very long time, and finding 
More precisely, the intervals shou 
mutually irrational with Op зебу; 

Consider first a single source. A time interval 
event, and the proportion о 
function of the frequencies і 


the frequency distribution of the 


andomness we now prove 
i» the number of events observed in time 
. The frequency distribution of the number of 
a large number of intervals of length / spread uniformly 
the proportion of intervals containing 0, 1, 2, ... events. 


ld be of the form (тос, пи +) (п = 1,2, ...), where а is 


1 «0; will include either no event or one 


f intervals including one event is 40;. Thus the generating 
8 


(1–0,) + &/0, = 1—(1—) tx; 
It follows from the inde 


pendence of the sources that the generating function for the pooled 
output is 


N 
GE) = ПП-й-004 


80 that log G(£) = 2 log П-(1-0 


is small for all $, 


“ха. 
If we assume that ty; 


N 
log G(¢) ~ à -t0-0, 


HS)  exp[- t1 — 0) Ey, 


are à number о 
nding only on its 
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4. THE VARIANCE-TIME CURVE 


The pooled output is distinguished from a random series by its behaviour over lengths of 
time comparable with the individual periods 0;. The most convenient way of expressing 
this behaviour is by the variance-time curve, V(t). This is defined as the variance of the 
number of events occurring in a time /. considered as a function of t. For a random series 


V(t) 2 At, (23) 
where A~! is the mean interval between successive events. The analogue of V(t) for con- 
tinuous stochastic processes has been extensively used, for example in studies of sampling 
and in work on irregularity in textile processing. 

To find V(t) for the pooled output of periodic sources, consider first a single source. Let 
Nit = n;+/;, where n; is an integer and 0< 2; < 1, i.e. 
B; = {х4}. 
Then an interval of length £ contains either n; or n;+1 events from this source and the 


limiting frequency of intervals containing n; + 1 events is /;. The variance of the two-point 
distribution is J,(1— //;). Since the different sources are, by Weyl’s theorem, independent, 


we have for N sources N 

Vit) = P BA В). (24) 
= 

More generally the generating function of the frequency distribution of the number of 


events in an interval t is 


N 
GE) = TI &"[1— 7; 2,01. (25) 
ici 
The general behaviour of (24) can be seen as follows. If ¢ is very small compared with 
hh Зе Bi — B) Pi = tyi 
во that V(t)~tdy; = ta, (26) 


where А-1 is the mean interval between successive events. Further 


6(1-00<4%% 


во that V(t) «tA. (27) 
For t large compared with 0;, BAL-B) <tr 
so that V(t) «tA. 


Finally, аз 1 increases, В; takes each value between 0 and 1 equally often giving //;(1— f) 
an average value of +. Thus for large t, V(!) oscillates about an average of $N. 

То sum up, V (t) is tangential at t = 0 to the straight line (23) representing a random series 
with the same mean number of events per unit time. V(t) falls below (23) as soon as #18 
comparable with an appreciable number of the periods 0,, and finally as ¢ tends to infinity 
V(t) oscillates about 4N. Some numerical examples are given in 55. 

We have assumed above that ЈУ is finite and this is certainly true in the applications we 
have in mind. We now give a very brief discussion of the behaviour when ЈУ is infinite. If 
Ly; is divergent the mean interval between successive events is zero so that time intervals 
can be found containing infinitely many events. If we exclude this possibility Ху; must 
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converge. By the argument for the finite case, V(t) oscillates for large t about the curve 
zN (t), where (4) is the number of periods less than 1. As # tends to infinity, V(t) tends to 
Тайпбу but the rate of increase of V(t) tends to zero by the convergence of Xy 


Ps 
Air 


5. NUMERICAL EXAMPLES 


Three special cases with N small have been investigated numerically. They are 


Example А: N = 4, Periods 1, 1-1317, 1-2781, 1-3486. 

Example B: N = 4, Periods 1, 1-1317, 3-2781, 6-8914., 

Example C: N = 10, Periods 1, 1-1327, 1-3819, 1-4662, 1:6173, 
2-2203, 2-6421. 

Fig. 3 shows the frequency distribution of the interv 

(a) the exact polynomial (10); 

(b) the exponential curve of the same mean; 

(c) the second approximation (20). 

Fig. 4 shows the variance-time 
Example C has approxim 
to depict all these. 

The following conclusions may 

(i) The second approximation (20) works well even when N i 

(1) For a given number of periods, the dist 
nential distribution when the periods 0, 

(iii) The distribution of intery 
if the 0, are close together. 

(iv) The variance-time curves h 
for a random series, 

For computational conve 
regard them аз close appro 
multiple of the 0; is extrem 
be extremely good approxi 


1-6593, 1.7814, 1-8181, 


al between successive events giving 


curves computed from equation (25). 


The curve for 
ately thirty cusps in the range shown; no 


attempt has been made 
be drawn: 


s quite small, 
ribution of intervals is closest to an expo- 
are nearly equal. 


als tends rapidly to the exponential form as N increases, 


ave a characteristic form entirely different from that 
nience the 0; could not be mutu 
ximations to irrationals or 
ely large, the results deriv: 
mations to the behaviour 


ally irrational. We may either 
argue that since the lowest common 


ed on the assumption (1) are likely 60 
for rational 0; 


6. ANALYSIS OF DATA 


аа; Suspected to be the pooled output of a number of 
б analysis са; T 
Ifthe number of Sources is smal] п Бо Suggested 


and if the series available for an 


assign eac} 
еркеш tion of the intervals between successiv® 
Point concentration of frequenc | 


alysis 15 long, it is possible 
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10 Superposition of several strictly periodic sequences of events 


Any method based on the frequency distribution of intervals is clearly insensitive xe 
the frequency curve is very nearly exponential except when N is small. A much bet 
procedure is to use the variance-time curve. To determine this from 
data, the following method may be used. First divide the series int 
such that only a small number of events occur in each interval. Х 
events occurring in each interval, so deriving 
the number of events in ( 


a set of experimental 
о intervals of length 7 
ext count the number of 
à discrete time series [а 


1—1)т<1<1т. Add the as together in blocks of r, giving 


UP wwe untae | 
Up = v... Жап (28) 
UE нд = Tm-r+1 +...+х@, 


т" 


The Us are the numbers of events occurring 
of V(rr) may be formed from the correc 
and take as the estimate 


in intervals of length тт so that an estimate 
ted sum of squares of the Us, Put M = т—7+1 


A 3M ~ ә 
Бе | = 'OIT 8 б ау 90) 
Ү(гт) [sar Зе У х corrected sum of squares of (s, ( 


Do this for a number of val 
chosen so that if the series 
constant mean 7A), then 


ues of r and plot Рот) 


against r7, The divisor in (29) has been 
is random (ће, if the 2; 


are independent Poisson variables with 


Elf (rr)] = V(rr) = "тд, 30) 
provided that r < Ут. iR á 
This analysis is closely connected with the serial correlation analysis of the а; (Yule, 1945). 
The variance of V(rr) can be shown by tedious but elementar 
series, 


У methods to be, for a random 


TTÀ TTÀ 

ЗМ (ТА 87 + 274) 4 33i (T TA + 16727 — 94) та #1874 — 2+1) + ofyn) | 

E у А -— 

m tel оа 54 that if the Series is the pooled output of N periodic sources, V(t) 

dd E t ЗУ for large t. This result enables us both to test for the existence of periods 
mate М. Tt will be shown in another paper (Cox & Smith, 1953) that if the 

© events on any one source have coeffi 


cient of variation C, then 


intervals between th 


V(t) О? for large t, (89 

where A71 is the mean interval h 
etwee | 
of sources сап be estimated from the Y() ew on the pooled output. Both C and the number 
Thus our method оға т? 


(а) arandom series; 


(6) 
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SuMMARY 


Suppose that there ате № sources, that on the ith source events occur at times 0,, 20,,..., and 
that the outputs of the № sources are combined into a single pooled output. Statistical 
properties of the pooled output are investigated. Methods are suggested for distinguishing 
it from a random series and for estimating N from experimental data. Applications are 
indicated to experimental psychology and to neuro-physiology. 


We are grateful to Dr J. Wishart for helpful comments on the draft of the paper. 
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APPROXIMATE CONFIDENCE INTERVALS 


Ву М. S. BARTLETT 
University of Manchester 


1. INTRODUCTION 
Itis fairly generally known that asymptotic confidence intervals for 
0 may be obtained directly from the likelihood (log) derivative 21, 
probability of the sample (see, for example, М. G. Kendall, 1 
are asymptotically equivalent to those obtained from the m 
д, and have the property of providing 
average (see Kendall, $19-12). The direct use of 0/20 has the advant 
variance are (under the usual differentiation conditions, whicl 
up to any order required) known exactly as 0 and 


oL? L 
DORE 
properties of 0, so that the only approximation introduced 
on of 91,00 is concerned is in regarding it as normal. This 


an unknown parameter 
160, where р = exp Lis the 
946, $19.10). Such intervals 
aximum-likelihood estimate 
asymptotically shortest confidence intervals on the 
age that its mean and 


1 will be assumed throughout 


in contrast with the asymptotic 
as far as the sampling distributi 


e value 0, for each critical value To. 
) mality approximation it is quite feasible to investigate also 
the higher moments of 91.100, so that a correction to allow, say, for its skewness is not unduly 
' | a further approximation pro- 

cedure. It would appear difficult to Specify precise and useful conditions under which such 
c à › SAY, is necessarily free from possible 
ements on the relation of T (0) with 0, but one or two incidental 
pect any correcting terms to have 
monotonic relation with 0 in " j samples, and thus not to disturb any 


: ы ы dmissible range (this 
ximum % 
7 likelihood esti - In such cases the fact that We 


Yes’ if 2120, а 
8 merely consi 


M. S. BARTLETT 13 


claimed. It might also be remarked that the approximation procedure is sometimes con- 
venient even if a sufficient statistic does exist, in cases where the exact distribution of 
the sufficient statistic is difficult to handle (e.g. if the observations are from truncated 
distributions). 

Only the case of one unknown parameter 0 will be considered here, but it is hoped to 
discuss the case of more than one unknown in a later paper. The extension to confidence 
regions is in principle straightforward, but the case of one unknown plus ‘nuisance para- 
meters’ is more subtle, being much more dependent on the nature of the sufficiency pro- 
perties of 0/00 (see earlier footnote; cf. also Bartlett, 1936, $6). 


2. Moments ог éL/é0 


For reference a convenient method will be given of expressing the moments of 0.L/00 in 
their simplest terms for evaluation, as an extension of the familiar result 


(в) | - 21-9). 


20 \ 7 202 
It is desirable to introduce a more convenient notation, and we shall write 
Вы] la Es m Le 
admis dean 
fy ane. LB ata 
кыно bs рез? - [res |" m ir mt «de 


where ІШ is a formal notation for the probability Stieltjes integral over all possible 


samples. Expanding the last expression in powers of 7, and equating coefficients, we obtain 
an infinite series of relations of which the first four are 
14 = 0, 
L,+L = 0, 
Та +3(L,L2) +19 = 0, 
Ly+BLQ-+4(L, Го) + 6(ТАЭТЬ) + 140 = 0. 


(1) 


The first two are the familiar ones already quoted, and the next two involve the third and 
fourth moments 7,9 апа 79. These are not yet in their simplest terms; thus it is possible 
to express L{® in ‘linear’ terms alone. We may, however, obtain an unlimited number of 
further relations by differentiating the relations (1). We have by differentiating the second 


and third relations once, noting that 


à aL 
Jp = 5 | 20 (20) = P+ 


“э 


15+ LP + 2(L, Ly) = 0, 
„Ра + (LPL) + 248 + (L L)} + {LP + 30:9) = 0. 
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Differentiating the former of these two derived relations again, we obtain further 
„По (Lf9 + 3(LPL,)} + (LOL) + L9 + (LZ, L4)) = 0. 

We find finally, reverting to the full notation for clarity, 


(22) рүе) „21, opf (2) 
к(а) == (о) 320+ 7B ЈЕ 
aL) _ pf (3A » 
“(%) (57 Ju 
ECCO E) asd (8) 
= 622+ 835 | – 3E ap +30 |а0 |» 


where it will be noticed that K, involves one ‘non-linear’ quantity, the last variance term 
involving the mean square of 221003, Whether these formulae (2) and (3) are useful will of 


course depend on the form of L; for example, in the case of the Cauchy distribution it is 
simpler to evaluate Ks and к, directly. 


3. APPROXIMATE CORRECTION FOR THE SKEWNESS or 22/0 


16 seems most useful to consider polynomial transformations of dL 
transformations may be chosen with the aid of the ађоу 
(and, if necessary, any higher поп. 
in practice that we shall wish to gi 
Consider the theoretical statistic 


дг Г? 4) 
T (0) = - +2 (29) -1] | ( 
which still has zero mean, and variance 


any 
Т+2Аку+ ^о (%) | | 


[90 as in principle such 
e relations to annihilate the skewness 
-zero cumulants) exactly, although it is perhaps unlikely 
о further than a first further approximation involving Ks: 


(9) 
Its skewness is ка + nel (5) j +O(A2), (6) 
As o*((0L/00)2} = ка + 219, we therefore choose, to the first order of approximation, 

и d 
whence ту 50—198 [G5) - г]. (8) 
отн н» ix бшен кё, is still 1 More accurate confidence intervals should 
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with mean zero, variance o? = I + 2v[k,- 372] + O(v?), (11) 
ЈАН 
skewness - E{T3} = ЗВ (55) | + O(v*) 
(20 
= 0--O(92, укр), (12) 
ОТА к 
and kurtosis ЕТ} - За = к, +31- + m (5) | — 804+ O(v"), 
or, after reduction, ка + 24018 + O(v?, ука, VKo). (13) 
1 ky 7 
Hence we choose = 2413 (14) 
ар ік, шр, А 
and то) a og. (15) 
with variance о ~ 1(1 — ka] 1). (16) 


4. EXAMPLE I 


To test out this procedure, we shall try it out on a standard problem whose exact solution is 

well known, namely, confidence limits for a (normal) sample variance. This example is 

taken because the sample variance s? is quite skew for a moderate number of degrees of 

freedom, and there is some interest in seeing how far the method can cope with this skewness. 
ý 


We have a 
SE cr кш 
00 920? 
021, n та 0n 
age = 208 @' Т =з» 
031, т 88° Е ЕЗ 2n 
208  @ 9^ |003) ov 
91 n ета т 
00 0° з= 0 


checking of course with the value inferred from the properties of s?. The 
he confidence interval would amount to solving 


эз 1—9) = en (>), (17) 


where for convenience we have expressed confidence limits 0 in terms of s?, i.e. have put 
s? = 1, The more accurate approximation suggested replaces (17) by the equation 


ga 0-9 АСЕ АЕ (18) 


nger yields а simple quadratic equation for 0, but to the same order 
Я bracket by its value from (17), viz. 20802 — 1)/n, 


the value of ка 
standard approximation for th 


This as it stands no lo 
of approximation we may replace the square 
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and (18) then also reduces to a quadratic (i.e. to a linear equation for each sign of и). A corre- 
sponding simplification is available in general, for the crude interval is obtained from 


an =+ 19) 
20 zu МТ, ( 
and substituting in 7(0) in (9), we obtain 
9b Тк. в 20) 
55—57 (2-1)- +д./1 ( 


as the corresponding modification of the general equation (9).1 
In arithmetical work where 01.190 is calculated for various values of 0, cither (20) or (9) 
may be used. The original approximation (19) is most conveniently plotted as 


я [ЧЇ = и, (21) 


for the values of 0 for which the left-hand side passes any chosen value of x can be read off 
from the same graph. Equation (9) similarly is written 


01, l«s[ (01; а = 99 
a5] Ч1—$11[ Go] V!) EIE (22) 


and may be used in the same way. The equation equivalent to (20), viz. 


oL lk 
a t - Rue - = аш, (23) 


may only be used at the one chosen value of д, but h: 
polation for 0 (or some suitable function 
be so also for (23), rather than for (22). 


as the slight advantage that if inter- 
g(0)) is approximately linear in (21), it is likely to 


Table 1 
" == "E 
Exs5tevalues First or standard Second or further 
approximation approximation 
n 
| 
Lower | Тонер | Upper | Upper yer | 
e pper Lower | Lower} Upper Upper | Lower | Lower | Upper Upper 
0-01 | 005 | 005 | 0-01 | 0:01 | 0-05 | 005 | 001 | oor | 005 | боз | 001 
| 
5 | 0-331 | 0-452 | 437 | ооз | 0.405 | 0490 | со* : 
: i- 9-0: 5 | 0-490 | о co* | 0:327 | 044 “35 | 855 
= MES ena n 3-91 | 0-490 | 0-576 | 3-78 | co* | 0-428 0.541 245 3:94 
80115 nC 84 2-42 | 0-576 | 0-658 | 2-08 378 | 0531 | 0-634 | 1-86 243 
: 162 | 2-01 | 0-625 | 0-709 | 1-74 | 2-50 | 0-589 | 0-684 | 1-63 | 2:01 


* In this example the 


Я approximations 
with the first approxim: T Шалыс 


| ak dow; imit for ЯС" i 
ation. even for some n at the upper limit for small and P; this occur 


of the values in the table. 


To return to (17) and (18), 
limits obtained from the dist 


and 0-01, or total significance limits of 0-1 
and 30 for т. oF 9-10 and 0-02, and repres 


e resulting limits are compared in Table 1 with the correct 
mbution of s*, for upper and lower significance limits Р = 0:05 


entative values 5, 10, 20 


> 
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It will be seen that even the second approximation is not too good at the upper limit for 
” = 5, and would be inferior to at least one other available approximation for the significance 
limits for 52, but the important point to remember is that it is a much more general method, 
and this example suggests that it should be a considerable improvement over the first 
approximation, which is still hardly satisfactory іп this example for n as large as 30. 


5. ExaAMPLE П 


As & second example we shall examine a standard discrete distribution, choosing for 
Simplicity the Poisson rather than the binomial. Here we easily find, for 0 the unknown 


mean and x an observation, OL = ү 1 
@ 6 >“ 9 
en 2 "m 
В] == ope 
Whence the crude approximation gives the equation 
v и 54 
--1-і--, 2 
вее (4) 
and the further approximation (corresponding to (20)) 
x 1 x ш DE 
1-25 (1—1) = +. (25) 
-—— Пе 


Of course for discrete distributions a confidence interval (at least if obtained from the data 
directly without introduction of extra randomization devices) can only be given as an 
Mequality for the confidence level. This is perhaps hardly crucial if an approximate method 


Table 2 
First or standard Second or further 
Exact values* approximation approximation 
а | | 
| 
Lowe ver ‚ | Lower | Lower | Upper | Upper | Lower Lower Upper Upper 
0-01. ae en pig 0-01 | 0:05 | 0:05 | 0-01 | 0-01 | 0:05 | 0:05 | 0-01 
mS | | | 


4:93 
| 6:86 
| 858 


. -37 3 

3: “61 + 3:64 6:37 T 

0-05 pu "En | 0-07 5-28 55 Т 0:02 à 
Б . 43 6-79 7 

tee | 559 1005 0 Т81 10:19 


7:75 | 10:05 0-92 8:22 | 11:33 0-36 0-77 


0 
1 

2 

5 1-9 Ў 1 0:92 | 1004 | 1430 | 1-21 | 1-93 | 10:56 | 13-22 

Ы . „А 3. a E >з Уч 

ал ата IE AES Im | 561 | 17-35 | 2121 | 407 5-40 | 17-00 | 20-23 
20 | 11.08 | 13.25 | 29-00 | 33-10 | 11-58 13-46 | 29-42 | 34-08 | 11-04 | 13-23 | 29-09 | 33-16 | 
30 | 18.74 | 21.50 | 40.60 | 45-40 | 1929 | 21-82 | 41-04 | 46-33 | 18-71 21-58 | 40-71 | 45-45 


Бо uo 
AARRWS 
Oe O 


US breaks down near 0 = 0. (The inadequacy of the secon xi 
©XPected, for the Poisson mean ô in this examp! 
Sample.) 
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isin any case being used, but it reminds us that a still further approximation is ont ~~ 
in obtaining limits from normal theory. For a purely discrete distribution as in » қ н ne 
some gain would be expected from the introduction of the usual ed y corr во z 3 ide 
implies using z— $ when obtaining the lower limit for 0 and «+4 for the upper ми ы a 
correction has therefore been used throughout, though from an examination of one ма pi 
values the consequent gain for the first crude approximation appeared much more doubt 
than for the second. | О | —S 
It will beseen from Table ? that the second approximation is a considerable improv anen | 
over the first (except near 0 = 0), and is in fact somewhat better for small 2 at tig upp 
limit than we might have anticipated from the experience with the first example. rhe pes 
values quoted were readily obtained by Garwood from ү? significance limits, and, as it 
Example I, it must be emphasized that the purpose of examining the present 


approximation 
in these standard examples is merely to obtain an idea of its accuracy. 


6. ExAwPLE III 


The problem which actually initiated this inquiry is being discussed in detail elsewhere, but 
it may be helpful to indicate it here. From the tracks of certain cosmie ray particles which 


‘decay’ into other fundamental particles, it was required to estimate the ‘decay’ parameter 
0 in the lifetime distribution F(t) dt = equo, (26) 


This is simple enough in the case of unlimited tr: 
finite size, not all particles will decay; 
particle because its time of passage thr 
Situation is not always the same as thi 
difficulties, but for simplicity here we s 
effective times Т (s = 1... № 
likelihood function in this 


this causes no difficulty. F 
ticle is Q, = е-7,0 


ack length, but for tracks in a chamber of 
the chance of doing so moreover varies with each 
‘ough the chamber depends on its momentum. The 
в and is often complicated by further observational 
hall suppose we have just № particles of one type, with 
) in the chamber, of which 7, decay at times t, (r = 1... n). The 


case is a mixture of finite probability factors and densities, but 
or such data we have, as the probability of the sth undecayed рат“ 
‚ and the probability density of the rth decayed particle ің S(t) = et [0, 
oL Bt у ҮП 


mp = — р Ed _8 27) 
Фон EG | 
giving in this particular case the immediate estimate 
A 1 n N 
ё (5+ X т) (28) 
т s=n+1 


i ple solution @ is pos bl pined 
ult to show that for equation (27) possible from the com 
A p 
Tz +s 9 
=» (20) 
where P, = 1—Q. Thus the first a 


obtained from equation (19), with 0100 i 
case when all Т become | d I by (29). In the limiting 
t from the exponential di 


В : à > to Example L for ureme? 
and the mean і would bs poca (26) is equivalent to а, Х with two fien јен 
(or 5) with әү degrees of freedom. In practic? 
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the 7 are often small, but in any case, in апу attempt to obtain a confidence interval for 
0, it would seem advisable when possible to go to the further approximation (9) or (20). 
It is not too difficult, at least in the case covered by equation (27), to obtain the exact 
moment-generating function J/(¢) of 0? &L[C0, remembering that the chance of decay some- 
where in the chamber for each particle is (s = 1... №). The answer comes out to be 
N 
М(ф) = П M(Q), with 
s=1 
M49) = e-# [1 9,67: {1 — (1 09) $5] — 9$). (30) 


Expanding log M (à), we find for the cumulants of 6221/20 


кү= 0, =F EB 
N N 

Ky = 203 У D,- 30° У QT (31) 
821 $21 


| 


N N N ДИРИ 
6N01— 803 У; Q,T, — 60° 270,0 х%- 304 x ©. 
s=1 з= = ge 


Ка 


However, as the distribution of 22/20 is a mixture of continuous and discrete components 
(for there is a finite chance that none of ће № particles decays), it would be difficult to make 
use of the exact distribution of 0/00 in this particular case, and the use of the cumulants 
(31), while first obtained directly by the above method, would be equivalent to using the 
arlier in this paper. The formulae in (31) agree of course with 


&eneral method developed e 
, and the expressions for Kg and к, were checked 


the results derived via the general method 
by this means. 

For maximum-likelihood or confidence interv 
9r interpolative methods may usually be used (s 


al equations not directly soluble, iterative 
ee equations (21), (22) and (23), $4). 


Iam indebted to Mrs A. Linnert for assistance with the calculations for Tables 1 and 2. 
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Notes added in proof. (i) The coefficients in the polynomial transformations considered 
In $3 are equivalent to those in a series expansion given by E. A. Cornish & R. A. 
Fisher in §8 of their paper ‘Moments and cumulants in the specification of distributions, 
Rey, Inst. Int. Statis. (1937), 4, which should be consulted if further terms of the 


9Xpangi — 
et ага ecg I have discussed further in my paper ‘On the 


Gi) The pr ferred to in $6 à : 
statistical Шат age ein. life-time,’ Phil. Mag. (7th series), 1953, 44 (in the Press). 
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INCOMPLETE AND ABSOLUTE MOMENTS OF THE MULTIVARIATE 
NORMAL DISTRIBUTION WITH SOME APPLICATIONS 


By A. R. KAMAT, 
University College, London 


1. INTRODUCTION 
Tt is a well-known property of norm 
process of totalling or averaging the 
also, of course, holds if the errors are с 


ally distributed error variables 21,2%, а, ... that the 
error produces a normal distribution. This property 


orrelated ‘normally’. Occasions arise, however, when 
it is necessary to consider the total numerical error 7 = [а |+ | a, |+ |а | +... (or its 


average). Tricomi (1936) has investigated the statistic T as 
the variables are uncorrel. 


at individual errors compound into 
rical values of individual errors. We may 
etal rods it is found that the lengths 
И error variable. Pairs of rods with 
anner requiring the longer of the pair to be filed dow? 
€ cross-section of the rods is constant, the total loss 9 

l to X [а 8 à sum of independent mod- 


21 — t|, which i 
se when special mechanisms give rise to a total erro" 


the diameter of the gauge 


trees Placement of the points 5 
ae Proportional to a+.¢ ly|, where с depends on the pit? 
While dealing with some 8 
pecial cases of с 
was concerned with the evaluat; ы e 


variables, the present apti 
as thereby naturally led to th 
Отта] distribution, namely: 


caleulation of the ‘absolute moments’ of the multivariate n 


++ P(x) ах, (1) 


о 
7*5», these variates having ue 
We shall not be able to deal W! 


nvestigation of this statistic į 
5 Statis 
16-27 below, к, 
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It is clear that the contributions from different quadrants can be obtained from (2) by 
changing the signs of the appropriate elements of the correlation matrix and that (1) can be 
found as the sum of all these contributions. 

Ја $2 of the paper we shall deal with the evaluation of [1, т, n] for the bivariate and the 
trivariate cases and hence with that of (1, т, n). Since the work was putin hand, papers have 
been published by Nabeya (1951, 1952) giving the absolute moments for the bivariate and 
the trivariate cases and a generalization in the form of a multiple integral for the multi- 
variate сазе.* Nabeya’s method is not, however, applicable to the evaluation of [}, m, n], 
and the methods developed in the present paper are also believed to be simpler for evaluating 
certain of the absolute moments (J, т, n). In $3, [l m, n, ...] and (l, m, n, ...) are obtained for 
4 multivariate distribution in the form of power series of correlation coefficients. Finally, 
$4 is concerned with the application of the results to the distribution of 7 and related 
Statistics, 

2. DERIVATION OF THE ABSOLUTE MOMENTS 
In the following, without loss of generality, the variates are measured in terms of their 
Standard deviations, so that the n-variate normal distribution is given by 


e Р 
(а Las ...,2,) = (21) wt exp Ее (4-1,0, Дү”) 


where w is the determinant of the correlation matrix, w,, are its cofactors, and the quadratic 


form Хв,т,ш, is positive definite. 
2.1. Univariate case 
For the univariate case the results are well known and are easily found by direct 
Integration. - - 
ате: = gmear( 27) М (3) 
0 


= 30) = (та | 


The following representation is given only to illustrate the method which will be followed 
1 о 


ји аар o : 
ù the bivariate and trivariate cases. Defining 


ые 2 l m = u^ -adz = 1 
it iş easy + 
y to see that "MU юр, (8) 
: FER. 
Where Py, = [ - уд ‚ Раа = [ = m]. , (6) 
а=1 =. 


Б + 
9 that the familiar results (3) are obtained. 
2.2. Bivariate case 


For the bivariate case © 
[m. n] " (ЕЕ y) da dy, (m, n) = | а" | |у" | p(x, 9) da di > 
F — © 


(a3 +9? —2pxy)?, 


Wh 1 
“У теу) = (20) (1 —р") 3 exP| (1—09) 
* Nabeya for letting us see a manuscript of his 
Васо; Wo gratefully acknowledge the courtesy of Mr 5. Nabey 
i Paper in advance of its publication. 


22 Absolute moments of the multivariate normal distribution 
the substitution z' = [2(1 — p?)]-* x, у’ = [2(1— p?)]-* y transforms (7) into 

[m,n] = 2-4 [2(1—p?) emen P, p | (8) 
and (m,n) = 27 [2(1 =p (E, nU Qua); 


where Pan = Пе” exp {—(x*+y?— 2pay)} ddy, 
0 


and Q,, п is obtained from У by changing p to — р. Defining three basic integrals 


Вы = [| дадага), Бр = [ее аа, вы = Гага, 9 
0 0 0 

where f(w,y) = exp {— (61122 + wy? + 2012) and the quadraticform оа + Way? + 201224 
is positive definite, it is easy to see that Эл,» can be obtained as a higher derivative of Ryo 07 
Ryo or Ro, (as the case may be) with res 


pect to the parameters 01,0; Wyo, With the sub- 
stitution of v, = 05 = 1 and w 


12 = —p after differentiation. Q,, n is obtained from Ра а PY 
simply replacing p by —p. For instance, 


R, 
Р’ = ( -1у a 
os 9 00%, я w= p 


and Ва= ( -1p жете, | 


201 —023—1, (915 — p* 


2-3. Bivariate case: Evaluation of the basic integrals 

It now remains to evaluate the basic integrals Ry, Rio and В. Changing to polar 

co-ordinates and using standard integration, we obtain for Коу the two expressions which 
will be used in the following: 


Ви = 191,05, — оду [ат —sin- (оду 


= HO W — 015) 3 cosi (y2(41 5,)73). 


1 May be evaluated in the sam 


Әп) (10) 
The integrals Ryo, В, | 
will also be used in 


а may be € way or by the following method whol 
e tr t ; i 1 
О ivariate case. Clearly the transformation A03 x = а, 920 
3 ћу = oF osi, (11) 
whe = 
те Ј = ПЕ а (02-02 + Залу) dedy, a= (опо) 019. 0% 
H E 9 
Similarly, М Ry = оті ox Ју 0% 
where 
Jy = fy exp(— (2-е 2axy)} йау. z 
D 
Now J, | 
Ei S | + 
E] | ( ay) exp {— (23 + y24. 2axy)} dedy, 
and introducing v= 


«ау, у’ = У we have 
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Similarly 
imilarly ај, J, = үт. (16) 
Solving (15) and (16) J, = Jy = 4n (1a) 
and substituting for а, Rj) and Ro, are obtained as 
Ry = 440i (vl oh + ев), 
17 
Ry = dum OR (ois оф + ox? ni 
9-4, Trivariate case 
For the trivariate normal distribution the contribution from the positive quadrant is 


given by 


© 


i 1 
l,m, n] = (2п) o || алата exp ЕН 4тубаубжу (7,8 = 1,2,3). (18) 
0 


The transformation a/ = (20) +2, carries (18) into 


ЕД т, n] = пл“ оне, (19) 
о 

where Баш; т = || | ад ay xy exp { > Eo, 2,0.) day ату das. (20) 
0 


Tn the same manner we can show that the absolute moment 


(1, m, m) = q- i2) minm [Da + Phn СЕ Р, т = pe, als 


where Р Onn (6 = 1, 2,3, 4) are obtained from (20) with the signs of ( + о), r+ sin each Pnn 
-pattern shown below: 


taken in accordance with the sign 


$ Wy W23 Cus 
i++ + 
ан = = (21) 
5; = + = 
dh = = = 
As in the bivariate case, it can be shown that Вт,” can be obtained by differentiations 
With respect to w,, from the four basic integrals defined as follows: 


Jim [| f (oi го» т») dv, 42,42» Rio = ез 23) 202303, 
5 : (22) 


Row = ПІР (44, 7» аҙ) da days, Шат Шы (23525, аз) da, баба 
0 
0 


= exp ( -Хо,а,2) (r,s =1,2, 3). 


The four results P9, ,(4-1,2,3, 4) are obtained essentially by the same differentiations, 
the difference being ийт in the insertion after differentiation of the appropriate signs of 


(+ ба), 7+ в in accordance with (21). 


where Ка» аз) 
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2-5. Trivariate case: Evaluation of basic integrals Ry and R, 
(а) Evaluation of Воо. 
The transformation 
Меца = a, Маут, = у, Мозв а = 2 


a 
gives Доу = (nent || Гоби 20у  2ayz + Эрг) dedydz, (28) 


=} 24 
where а = (w303) 10, b= (зи), c= (оц 0з) шуу. (24) 
The transformation x = rsin 0 cos 


P, Y = rcosÜ,z = rsin Osin ф transforms the integral on 
the right-hand side of (23) into 


| E in Г. exp(—7?[1 +2 2(asin ф +b cos ф) sin 0 соѕ0 + 2c 8120316 pcos фт бағ 404$ 
=0 


-14- ЈЕ аф Г П-->(авіпф--б сов Ф) зіп 0 cos 0 + 2c sin? f sin ф cos ф]-ї sin 040. 


The substitution ¢ = tan 0 gives, on integration, 


iz 
foo Hr otia] [| вава ауы. (ра. ayy 4), ~ 
0 
where A= 1--2свіп ф cos ф, В-азіпф--ӛсовд. 
Тһе весопа integral in (25)1 


15 evaluated at once; the first i 
tand+e = tan 9”. After so 


is integrated with the substitution 
me simplification, we obtain 


Roo = фут (0931 055 005,)-* А — У tani e , РА 
a,b,c 
lec b 
where беа 1 als (от 05 0з) үу, "u 
| bal 
W being the determinant 
Од др оу 
Өз 0 0 28) 
{ ©з Өз Wag 
Since а — ђе = НЕ (O30, 79139035), ete., we have finally 
m brane аен zi (29) 
(b) Evaluati ы 
ation, of Rio Row Ron. 
Employing the same и 
= оү! (оз) Js (30) 
where 
z- ffe exp (— (ata. yg 2сау + зада 4. 2bzx)} da dy dz 
а, 0, с having the same meaning ag in (24). Similarly 
Ro = вы (озо) Jy (31) 
Fo = oii (o, o yi (32) 
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where J,, J, ave defined in a manner similar to J,. The substitution z' = x+cy+ bz, у = y, 
2 = 2 gives 


T+ cJ, +В, = Е exp { — [r2 4- (1 — 02) у' + (1— c?) 2? + 2(a — bc) y'z']) ах' ау dz’ 


1 

шеге [[ewt- (y? + 2? + 2ay' =} ау dz' =I, (33) 

7% 
where, from (10), І, = 4(1—a2)-? cos? (а). (84) 
Similarly, cJ, 4- J, t aJ. = p (35) 
bd, + ad, +I, = І, (36) 
where 1, I, are defined as in (27). Solving these linear equations (33), (35) and (36) we have 
J, = А-Ца — a?) I, + (ab — c) y+ (ac — 0) 1), (37) 


where A is defined as in (34) above. Expressing а, b, с in terms of w,,, we have 
= 4 LE (о (gg — 033) 00571 (0 (Өз gg) *) 


Wag Wig — 01033 ____ ауу O23 12-1399 1 = 
2313 — 1233 871 (o), „(уу (033)73) + 08-1 (WhO 959) 3) (38) 
(5301 — оз) (na ( ss) (0110) — ja)? z 


where W is defined as in (28). Roro: Roo: Will have similar expressions. 


2-6. Table of results 

We give below а list of [m, n] and (m, n) for the bivariate case and (7, m, n) for the trivariate 
Case up to weight six. For a fuller list of (l, m,n) the reader is referred to Nabeya (1951, 
1952). Values of [/, m, n] for the trivariate case have not been given here because of their 
lengthy expressions. The following observations may be made: (1) All [m,n] and some 
т, n] can be obtained by direct integration, but the procedure involved is very tedious. 
(2) When the differentiations of the R integrals concerned are with respect to w, (7 = s) only, 
Simplification may be introduced by expressing them as differentiations with respect to 
а, b, с. (3) Absolute moments, when 7, m, n are all even, are the same as ordinary moments 
Of the normal distribution. (4) It is easily seen that (1,0, 0) = (1) and (l,m, 0) = (l, m). (5) We 
May also note that the moments about zero of the distributions 2(2,%) = ce-9 v7? and 
P(X, ta, та) = се-9 ту тет), where Q is a positive definite quadratic form and 21, ж», а > 0, can 


be obtained from formulae essentially similar to [m,n] or [l, m, n]. 


Bivariate results for [l,m]: 


[0,0] = z (4r - sin p); 


[1,0] = + 


[2,0] = = (фар +p ја –РУ 
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[1,1] = 5 fold + sin- p) + (1 — гу, 
01-1 Farre-p), 
ба] = ИРУ, 
[4,0] = gz Gr sint) * A — p?) (5p — 23), 
[8.1] = Fold + sin-1p) e Ја p) (2.155), 


[2,2] = zl +208) (1r -sin p) + 3p /(1 -рЭ) 


, 


1 1 

15,0 = 1 [A Le py (S— 05.1 959), 
1 

№, $ Тарр, 
L 72 

821-2 [520 epp, 


[ 
[6,0] = 05 (37+sin-1p) *J - р?) ( (83р — 26p3 + 8р5)), 


5, умар P) A. — p?) (8+ 9p2— 2p')}, 


[4,2] = $2 -ea +4p?) (E + 871) Р)+/(1-— р?) (13p + 2рз)} 


3,3] = = ~ 
[3,3] = Bel (3 + 2p) (Jr -- sin-1 p) P)+4(1— р?) ) (4+ 11у}, 


Univ 
ariate, bivariate and trivariate absolute moments : 


(1,0,0) = (2, 
п 
(2,0,0) = 1, 


(51,9) = а 


800 = ЈУ, 


7 
(2,1,0) = NEUEM 


pa, ys er m 
(0р i 
3 Різ) віп-і 
Коу E. 12013) Sin: Роза), 


— pis) +рузвіп-1 Piz} 


нь 
2“ Pie) (2 pis) + БН sin o, } 
әр 
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(2, 2, 0) = 1+ 20%, 


2 i 2 2 2 
(2,1,1) = = Pos + 2руәруз) Sin pas + (1 — 083) (1 + pla + pi); 


2 
5,0,0)=8 /=, 
(5,0,0) Je 


o 
(4,1, 0) = + 2 (3+ 6p,- ptg). 


2 
(3, 2,0) = NET 


л 
233 "met 5 
ae С) {(2 + pta + Pia) 9* + 2(раз + ЗО») SIN™ 2234 
| + [Зраз(1 + Piz) +P12P23(3 — Pig)] sin руз. 
+ [8p1o(1 + Pis) + Разрга(3 — 13) sim 7 рә), 
ə А 
(2,2,1) = ТЕ (1 + 2рђ + pla + 3a — Pa P3 + MP1s Pis Pa); 
(6,0,0) = 15, 


(бр, sin Py, + VO — pia) (8 + дрђ— 2013). 


a w? 
—4P12P13P23— 24a — Эр + 413+ ге) , 
23 


(3, 3,0) = 2 (8р3 + 20%) sin ра» + 4/(1 — рї») (4+ 110%), 
{8(0 + 2P12P23 + Эру рз) Sin 213 
4A — pis) (2+ 6р? + рӯз + 2033 — 291533 + бра уз аз)» 
(2,2, 2) = 1+ (0% is + Pia) + 802013022: 
included in (1, 1, 1), (3, 1, 1) and (4, 1, 1), is the determinant of the 
rtial correlation coefficients included in (1, 1, 1) and (3, 1, 1) 


(3,2,1) = 


ЕШ] 


Note. The expression @, 
Correlation matrix, while the ра 


анны Реза = (P23 = pasas A —pis) /(1—рїз)}, eto. 


2-7. Generalization to the multivariate case 

g the method given above may now be stated in a general form. 
n-variate normal distribution can be expressed in terms of the 
derivatives of n +1 basic integrals consisting of (i) В and (ii) n integrals of the type В, of 
that order, The latter satisfy a set of n linear equations among themselves and integrals of 
the type R, of the (n.— 1)th order and hence are determinate ifthe Ry integral of the (n — 1)th 
order is known. For instance, since Roo is known, we can find 1,000; Долоо» ооз, ооо, where 


R a = ехр { ть Хо,т,2;) do, das da da, (ғ, $ = 1,2,8, 4). 
1000 ~~ 
0 


The principle underlyin 
The absolute moments of an 


( 
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distribution 
Thus 000(9:3, 034, Сад), Oy, Ziz 04 | 
log А-а Rooo(%13; 44, 44), l, Gog, X | (39) 
Rio = loj (o) Әз)” А- n 
рен | Pooo(% 145 24, 242), Әз» l, ац 
| Воро(&э› 03, 212), Фу, бы, d 
where , # = [o 


uh а= 01000) 


and Која, b, с) is the three-dimensional integral 


Шы» {— (23 2 4224 avy + Зђуг + 2czx)} dx dy dz. 
0 


It seems, however, that the Ко integral of every order has to be determined directly. Since 
the evaluation of this integr. 


al becomes difficult for n>3, the present approach gives, Ш 
finite form, all absolute moments up to the trivariate normal distribution and also the 
Separate contribution of every quadrant, and similar odd-weight results (1-- ою -+n -+ odd) 
for the four-variate case, 


3. EXPRESSIONS ror [6, m, n, ...] 


3-1. General theory 
Its of $2 are confi » Yivariate and trivariate case f 
and the separate contribution 0 
multivariate case. To avoid com- 


| f late case is considered, but, as can be seen from the 
thod is quite general. Let 


» only the four-vari 
derivation, the me 


[оу (rs = 1,2, 3, 4) 


teristicfunctionig exp(— 15 — Ур, „1.0 


П of the positive quadrant 


ШЫ = | Мамам: 
0 


ШАРТ 
Ш] 1799 23 24 (27) ] TPE- шн, шада) da 
“Ды 
|| 21222524 || “PL 4283540) 


Ф 
x (уз (- пи Pie Ph pt pe ty 


Я РФ 1ш Reps sos кнн) а jd 
= EXXXYy (- ци ЕТА ры pi, 
21417151 tlut LS 
Where Nass 1 Ma ә | (40) 
2g 6 ([` Е Oey а) dis; (4 1) 
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An exactly similar procedure gives the absolute moment 


eo 


л) = [Ts |e e s рд а 
-YXXYXXXX — qmtotrastits Pls Ps Pla ар р 
БЕХУ 1)рза + pigirls!t! a! рід Hi piss Назви Hia rte 
| | А (42) 
whe Гада 
re y= | etf | Peata) ae. (43) 
To evaluate И, ; and Н, ; we note that 
1 ге 1 1 d) 
ы j -М?-Их ia? 
ЧЕ й#ечё-шаб= amar 
and therefore — 0 (gae 
А a n (44) 
The repeated integration by parts gives 
<j: К; = (- МИ) (2-14-09 
=0 ()-! even) 
aminti] А Hm 
. Ange (1 odd). T 
(т)! 2М-4- Fe} 
12): | : = ыла 
j Құт Cien gnus -i eM dæ 
(=i q auo p( 277 2+ c 
(46) 


RUSO ee ee 
g Gr) 0—7)! 
Tn the case of H, ; it is easily shown that 
Н,-9 (j odd) | 5 
= 28; (j even). ИО 


3-2, Illustrative example 


2 2 И MUSS 
П, 1,1, 1] = (Rs ы бы) Spy + BEB зд, 
F, (Е.Е Fs)? 08,0)? 
+ Е бабы 5 Spi қас Gur Xpapu — Бай ы yg, 
(48) 


m 


Where 4, j, X, 1 are all different. The 
replacing В, , by H,,;- The substitution for F’s and H ’s gives 


absolute moment (1, 1, 1, 1) is obtained from (48) by 
[1,1,1,1] = 2 ats Бро + aa BP Š Брура: + ts ХрдРи 


ig; PtP = 1л? 1 р рй d 8л 1 урурьы tong — Epi 
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4 2 4 2 2 
and (1,1,1,1) = тв (1 250? + рур ру t d Хр + 3Xpi;piy 


EP is р + Ур ур рыр ...). - 
= 0 reduces [1,1,,1,,1,] to the integral [|| Е 
see Moran, 1948; Kendall, 1941, 1945) 


It may be observed that ћеђеђе=ђ 


taken over the positive quadrant ( 


4. APPLICATIONS ТО THE DISTRIBUTION OF A “TOTAL ERROR? 


4-1. The moments in special cases 
Let us now consider the distribution of the tot 
confining the discussion to two or three norm 
variances of the component errors will in gener. 
to the consideration of the following stand 
preceding theoretical treatment, the 278 ha 


Case: (1) a,+¢ [88]; 


(iii) 012 + 6, | ж | + es | та |, 

The distributions of d 
with zero means; c, с;, 
dealing with case (iv) b 
The first four momen: 


al error 7’ mentioned in the introduction, 
al and mod-normal component errors. The 
al be unequal, but the problem can be reduced 
ard forms of 7, where, in conformity with the 
ve all unit variance: 

(ii) || сіз), 

(іу) e [а [es | 2% | + es | a. 

ж and 21, %, a are bivariate and trivari 
са Не between —1 and + 15 в: = 

elow to shorten the writing of mo 


ts of Т calculated with the help of the absolute moments are given 
below for cases (i), (8) and (iv). In the last са 


8e, the moments и; have been given about zero 
and not the mean, as the expressions for Из and jt, are very lengthy: 
Case (i. Т = ад + 0 | 2, |. 


ил = с 2 
Из = = 


Ва = 4-3), 


ate normal respectively 
l but is retained as such when 
ments. 


T, 


Ape se yl А 


Иа = 3[(1+c2) 
КА ЛП 


Ш 


Из 


4 
2. 4c?p?] = [3c2(1 + 292) + сї]— ud 
Case (ii), T = лї 


ў 2 
b= = 
йү NEUEM 


ж Marc 
2 
NEMUS. ы we оца. 
Ha = 3-9 ер 


1), 


2 іп-1 
0") +рвіп-1 p)], (80) 
Sing. ар 

= (1— с?) Du о ааа з 


+4 
+ (2) паса +» ке (МА — р?) (2+ р?) 


in—1 
WAP") раа) sn +0), der 
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Case (iv). T = с, |a, | es | vs | + cs | |]. 


Za № 
A= Re 


Its => ays 


= Xe;cj(4 (1 — p1;) + p; sin! pij), 


9 
= [o xe Sete +) 


AV 
=) 6c > А Pij — PikPik 51 
40) ШЕШЕ ый (у /a — pi) t (ЕСІ m 


дщ = 3Xcl + 6XcFc3(1 + 20) 
"es = (Ed ДУ — 93) (2 + p) + 3p; sin ps] 

+ расу сре (рај + рь) $n Рај + A. — pi;) (1+ Pin рўь))]} 

(i,j, k = 1,2,8) 
Where w, used in the expression for i5, is the determinant of the correlation matrix of the 

trivariate normal distribution of 2, 25, 2. 
4-9, Derivation of probability integrals 

To consider the distribution of 7, we note that the probability density functions for the 


first two cases are, respectively, 


P(x, | ж» D =p(% y) = 72,9; p) fy: -р) (—c0«2 «00,0 «y <с0) (82) 
v 

on pilal | га Do 65.9) = 2 аһ) 0) + Sey — РЈ (<, <>). (53) 

Tite fis. ys p) = (рта Q1 — pP) exp (7 0а +: — Эр) — 9) 

Case (i). T = 21 4c | vs |. 
The probability integral may be written as 
Pr(T «d & Pr (z-- cy <d} = Л f(x. у; p) азау+ | Та, у; — р) dedy 
х+су<@ х+су<@ 

= (4+5), where—00<2%<00,0<y<o. (54) 


By appropriate linear transformations the integrals 1,1, шау be expressed in terms 


of functions of the type 
A(1-p2)- МЕ. [- = (а2--/2- град) атау. 
pk) = 20) * Q—-P T^]. 30 —p) 


k 
This function is tabulated in Karl Pearson’s (1931) Tables for Statisticians and Biometricians, 


2, Tables УШ and IX. Thus, for d> 0, 


T= ф-р, (0, = (p-c)) (1 ++ 2p) di = d[A (1 ++ 2ep), (55) 
I пе А 
and for 4<0 (4--4,4>% 
тор „фый. Prix (раме, duo О +0 + 20р), (56) 
p Wf em 


1, is obtained from (55) and (56) when p is replaced by –р. 
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Case (ii). T = |а | + | os |. 
Р{Т<й}=Рг{в+су<4) = 2| [| Да, у; р)ахау + || Ја, y; -р)4ғ4) | 


z+ey<d т%су<4 
= 254-1] (0<7 <). р (57) 
Here it is necessary to consider с > 0, c< 0 separately. 
€» 0: I, = р,(0,0) — ps (di, 0) *p5,0) py = (p+ c)IA(1 + e24- Зер), 
Ра = — (1 еру ЏИ +е8 + әср), а, = d[A (Y + с? + Me 
€ «0 (c = —c' с> 0): dn 


6) d>0: I, = p,(0,0) "Pads 0), p, = (рс) +e 20р), d, = ас 2р), 


m e" 
ü) 4<0(4- -Ф,Ф>0): 

1-2,0, py = (ерле oet) 

The corresponding expressions 

Case (iv). Т 


Specific values of Cy, са by numerical 

» б; Tm 
; belong and tedious. Alternatively: 
may be fitted to give an approximation 


B f e 
ed 2: Жо, Xa W. vestigated in sont 
b Ya 2; Was investigated i 
st four moments of T are | е" 


, 2 
№ = ЈЕ за, 


2 (4 (5j 21,23 (61) 
figs ЈЕ ]- , 2,8). 
a E | (уа, 
= 4 19 
Щщ = (5-3 ха ( 912 
iT 6[1—2 
Betting = 1, 1 ы >) Хас 
442 
- 1 =з 
d 
= ce 3 (die 
(-3 (Edgy 
т, 
TE е 
Amie TL T sets 
1~£)" 1+ 
Table 1 givi Pis fts айр, ( 1) Са + сз}? 
А 3 P9 
rima lie in the restan pe ты Mes of c, and ¢ Iti ) 
© posit: 5 ls * 101565 
: 4. ї the Points does wi ^ 3« f, On the 4: from the table that the (Bubs 
2 “s ate Negative it ia e = Very my snes agram. When ср, су are ров! | 
T & М > к 
9 Type y region and 0и In ће Type Т region | 
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4:3. Adequacy of Pearson-curve approximation tested in special cases 


That a Pearson curve may often give a good approximation to the probability integral 
(possibly even in the correlated case) is illustrated by the following examples. 

Example (1). For the uncorrelated case, with 7 = | 2, | + | 2, | it is found from (61), (62) 
by putting с, = 1, c, = 0, а = 1:5958, 0(Т) = 0-853, f, = 0-495, fj, = 3-435. The upper and 
lower 5 % points for the corresponding Pearson curve can be obtained from Pearson & 
Merrington’s (1951) table; these values are compared below with those obtained directly by 
the use of the formula (58), using inverse interpolation: 


Lower 5 95 Upper 5 % 
‘True 0:38 3:17 
Pearson fit 0:43 3:17 


Table 1. Values of f, and 8„— 3 (bold type) for the distribution of T = | а, | Ев, | 2, | + са | 25 | 
when 2, Xp, Фу are uncorrelated 


| т T 
| 1 
| с. 
e „КЕШЕ; | —06| 04 | –02 | 00 o2 | 04 | 0-6 0-8 10 |% E 
ы | = | | 
| | ей | 
| —10 |0-0367 |0-0141 | 0-0035 |0-040 | 0-0°75 | 0-000 |0-0°75 |0-0240 0-0035 |0:0141 | 0-0367 | —10 
290 | .301 | 332 | -377 | -418 | -435 | -418 | -377 | -332 | -301 | -290 
| 
-0 E .0092 | -0305 | -0481 | -0535| -0514| -0518| -0614 | -0836 | -120 | -08 
d 048 | 0092 | Mes | 435 | -455 | -435 | 385 | 334 | -304 | -301 
-0 (698 | -146 | 217 | -242 | -226 | -203 | 105 | 208 | -240 | -06 
и 10628 | ds | 502 | -531 | -502 | -435 | -370 | 334 | 332 
-0: -32 -494 | -556 | -511 | -431 | -374 | 356 | 368 | —04 | 
bU 324 .620 | -663 | -620 | -524 | -435 | -385 | -377 
~0-2 .761 | -867 | -786 | -639 | -527 | -473 | -463 | —02 
P 1748 | -805 | -748 | -620 | -502 | -435 | -418 
0 0-991 |0-895 |0:718 | 0-582 |0:518 | 0-495 0-0 
Ы .869 | -805 | -663 | -531 | -455 | -435 
811 | -659 | 541 | -483 | -470 0:2 
фа .748 | -620 | -502 | -435 | -418 
-548 | -462 | -422 | -419 04 
м 524 | -435 | :385 | -377 
| 
399 | -370 | -370 06 
56 370 | -334 | 332 
342 | -340 08 
28 304 | -301 | 
+330 1.0 | 
50 -290 


Example (9). The mean successive difference with the sample size three, i.e. 


T = M|2zi— 22 |+ |, —% |}, 
is an illustration of a correlated total error. In this case it can be shown using (50) that 


3 gi c» 151886, a) = 0610, В, = 1036, „= 4296. 
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The lower and upper 5 % points for the corresponding Pearson curve as obtained from the 


Pearson-Merrington table are compared below with those obtained for the exact dis- 
tribution by using (58): 


Lower 5 % Upper 5 % 
‘True 0:27 2-40 
Pearson fit 0-28 2:40 


Example (3). The distribution of the mean deviation has been obtained by Godwin (1945). 
Considering the case of the sample size three, we have, using (51) 


Щщ = 06515, с = 0-942, р, = 0417, fy 3-286. 


The following is a comparison of the actual percentage points with those obtained from the 
Pearson-Merrington table: 


Lower 0-5 9% Lower 5% Upper 5% Upper 0:5 96 
True 0-052 0-166 1-276 1:703 
Pearson fit — 0-009 0-157 1-270 1:696 


Tt is to be noted that the last figure in the second line is not reliable since the Pearson- 
Merrington table gives only two places of decimals for the standardized deviate. 


Before concluding I should like to express my grateful thanks to Dr H. O. Hartley for his 
advice during these investigations and to Prof. E. S. Pe 


arson for helpful suggestions in the 
writing of this paper. 
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ON THE RANGE OF PARTIAL SUMS OF A FINITE NUMBER 
OF INDEPENDENT NORMAL VARIATES 


By А. A. ANIS axb E. Н. LLOYD 


Imperial College, London 


The properties of the range of the partial sums Уу, Sa .... Sn of the independent variates Жа атса Ж 


Bis: ; e ~ Е d 

(реп tni in the theory of storage capacity. The asymptotic distribution, for large n, is known 

fie er, 1951). The present. paper discusses the mean value of the range for finite n, in the case where 
€ X, are standard normal variates. This is shown to depend on the integral 


= 


à nis 
enr | ems] exp— Hyp Un JY а T V Њу da Фу 
0 0 


эше value is found to be (r+ 171. 
he expected value of the range is shown to be 


эу п-1 
(9 2" 
п 1 


1. INTRODUCTION 


E — investigation was suggested by the problem of planning the storage capacity 
eservoir, where one would like to know the distribution of the water level over a given 
number a of years. The level after 7 years may be regarded as the sum ofr annualincrements, 
SR we may approximate this real problem by the ideal one where the annual increments 
ате independent variates with a common distribution. Applications to other storage 
Problems are obvious. 
» In the case of very large n the asymptotic range of the consequent random walk has been 
tScussed by Feller (1951). In the present paper we are concerned with an exact formulation 
Or the case of finite т, taking the increments to be standard normal variates, and for this 


e 
ар We obtain a simple expression for the mean range. | 
" The paper falls into two parts. In the first (882-5), the problem is reduced to that of 
Valuating the integral 
- ЖЕНЕ Ed 
а: (ЕЈ «| exp) X У а“ 29] dy + dy,. 
0 0 | 1 1 
to have the value (4/(27)) (r + 1)73, whence the 


І 
1 the second part (§6) this integral is shown 
found to be 


ex à 
Pected value of the range, over n years, 18 


[eee 


2. STATEMENT OF THE PROBLEM 


We Consider n independent variates pe cm each with the same probability density 


not: 
nction (x), and their partial sums 
үу, S, = Хы Х.Х, (ғ-1,2,..., n) 
а U, = Max(S) and 1,- М (8/5 
и т 
T 
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then the mean range is &(U,— L,) = &(U,)—6(L,). Let f (к) be the probability density 


function of U,, and let F (x) and 1— G, (x) respectively be the distribution functions of 0, 


and Би, во that Е, (=) = Рг(0,<ш) апа G(x) = Pr(L, 22). 


n 
We have Fly) = ]- (n) afi plx) dx; 
їду, =. 
1 
7—1,2,...,n 
n 
and а, (у) = је (т) ...| П p(x) dx. 
Баљу, е 
т-1,2,...,п 


This latter may be alternatively written in the form 


6,(-у) = [ж (n) ... n $( —2,) dt; 


corresponding to the observation that Min {S,} = — Max ( — S,}. The functions И, and б, are 


thus integrals of the same type with identical boundary conditions, If ф(а) is an even 
function it follows of course that Ку) = G,(—y). 


3. &(U,) As А LINEAR FORM IN THE /Ғ.(0) 
If we make the transformation 
T 
аң ку or ф = )-і- У» 


the above expression for Е, becomes 


Р со vo m 
str) = | 00. TE Sora) dye 
It will be convenient to define В (=) =1. It is clear from this formulation that 
Е, (о) = Г Фо — у) Р, (4) Чу, (91) 
whence, on differentiating and then integrating by parts 


Јави) = FO y) + [^ у, 
0 


=) fai) у. (3:2) 
Using this as a reduction formula we are led to 
where ыбы) = ХР, 0) з), (83) 
һи) -f of diis 
0 0-2) Фу — ya) “Фф, 1- 
and Ау) g(y). 6 317 V) ty) dy, ey, (pm 1,9,...,7) 
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The expected value д, of И, 1 is equal to jū af a (2) dx, and, using (3-3), we obtain 


Кат = x q, F, 0), (3-4) 
Where 4. = | ^ обод. 
Since | Wo fo — Yr) Wo = 13 


it is easily seen that 
со со 
1 =Í eo) 2 yid — Yo) ... 937 Yr) PY) Фул dy, (r— 1,2,...т), (3:5) 
0 
with Ф = 0. 
4. AN IDENTITY IN THE DISTRIBUTION FUNCTIONS 


If we consider a particular S, we have 


5, = S, (XX +... +Х ы) = S,—T,say, (i= 1,2,...,"— 1), 
ч балы Ха tX) = +, say, (j= 1,2,...,n—r). 


If S, is the greatest of the partial sums we must have 
HO (= 1,2, a0%— 1) and 2;<0 ()-1,2,...,т-?), 
12 ‚2, 


and, in particular, Min {7}} > 0, Мах {В} <0. Now Т, and В, ате independent partial sums 
i j 
80 t 
is Pr (S, = Max {S}) = 6,0) 7,40) 


Further since some S. must be the largest, the probability being 0 thatany two partial sums 
2 r 


are equal, we have E 
X а. (0) F, (0) = 1. 
ті 


Where ф(а) is an even function this becomes 


y E (0) £,.,(0) = 1. (41) 


т=1 


" 
5. THE SPECIAL CASE OF NORMAL X; 


We 1 1 the X, are standard normal variates, so that 
now ecial case where i А Лам 
$(u) = өч: ти —ud(u). Applying this latter result after differentiating (3-1) we 
Bet, with a slight change of notation, 


рый = i * (eu) oly 2 Ба. 


i We multiply (3:1) by y and add to this the last result, we find 


анты = |, 007 Fale) de (5-1) 
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ri i he 
Taking п = 0,1,...,r—1 in this formula, and applying the results to the reduction of th 
expression (3-5) for 4,, we find 


q -f (70 [^9 y) dt = уд Bl ts iy 
т 0 4 


+... +| “ы-і Qi Yr) BY) dy, 140, 
0 0 


со 7 5 5.3) 
+f f, lun) dy, + | YL AY) ФО) dy, (5 
0 40 


= T om integrated 
The last term in this expression equals -| 2,100.) P(Y) dy, and this may be integr 

0 
to give 


(0) 810) + [иде 


5 А за лев enbatitute for 
This last integral is identical with the penultimate term of (5:2). If in this we substitute 
fray) according to (3-2), the integral becomes 


fad dy, = қ. [^ феуду, + [7 [7 aU BYr1— у) Феј ду, dle 
% 0 0 4.0 


The last integral here ag 


айы с . а inuing in this wav we finally 
am coincides with a term of (5-2). Continuing in this way we fi 
obtain for (5-2) 


у 5.3) 
t= XsbF (0) (ғ-1,2,...,п), (9 
where з=1 
eo © 
b= |; (s— yf’ (9,2) 9I, ыз — Yrsa) e Oa =Y) Ф(у„) dy, за ++ ЧУ, 


(5 = 8,4... n) (5% 
with 8-40) and b = Г $y) dy. 


Using the expansion (3-4) for База We obtain 


n r 
Амы = У > sb,F. (0) FE, (0), 
7=18=1 
which, with the aid of the identity (41), reduces to 
n 5 
Ка = У sby ж 
5=1 

" у . "am 
| The problem is thus reduced to that of evaluating the bs. It is convenient to rewrite (9 
in the form 5-6) 
b (mro. (8-1,2, «es fl (2 
where, explicitly now, 


=l, qe | edy, 
i = 0 
ш! ој, шына ма Секен (s = 2,3 


2 — 
Әх 


6. EVALUATION OF THE с 
8 


A>0, 
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where А is a vector of order k, means 


ді>0 (0 = 1,2,..., К). 
We write (5-7) as 


e, = Ге e 353 dy, ... dy,, (6:1) 
Југо 
where А is the (r x 7) matrix 

8-і D as 

= 2 – 0 
0--і 8-і 0 % 

А- 

0 0-1 2-1 
0 0-1 2 


This matrix is symmetric and positive-definite, and may therefore be resolved into tri- 


angular factors: А=тт 


Where T is upper triangular. It is easily found that 


The exponent in the integrand of ¢, in (6:1) thus becomes 


у'Ау = y'T'Ty = u'u, 
Say, where и = Ty. 


Let us make this substitution in the integral. The J acobian is 


ЦА|- 1A - 1). 


(r41)e = |, (r) [em Um (6-2) 


7iu20. 


Then (6-1) becomes 


ay be written in the form 


u-Tz (220), 


The region of integration m 


Or, Slightly more conveniently for our purpose, as 
u-Lx (x20) 


Where г = Та 
= (Ay Ae: m Sr 


(6:3) 
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and the columns 2, of L are defined explicitly by 


21-(1,0, |), 


г 8—1 $+1 : 28 4 
%= (es = f [PEA 0.) = 2,8; ои). 


Here 0; denotes the zero vector of order i. We note that the A; are normalized, i.e. 


AAg=1 @=1,2,...,7). 


We now introduce a vector A, 41 defined by the relation 


МА ФА = 0. 
It is easily verified that we then have 
ХАа=-% | 
5 р Lo 12,5: 1). 
Мин (j»1) ti "M 


_ the suffices being reduced modulo “+1 when appropriate, and that the new vector 218 
normalized. 


We now introduce the following (r x r) matrices: 


L, = (Ag, Az, 52354) 
L, = (Ag, Aq, по Artis Ay) 


Lis = (%, Ao, te ,); 
the last of these being the matrix L of (6-8). 


It may be seen that 
LiL = LL, =... = Lisa (= 3A), 
whence the І, may be shown to be orthogonal transforms of one another; for let 
к. PL, =L; (+); 


=I, 
Since the inte 
grai 
therefore describe t ogonal transformations of u, we may 
$ 
g,: 


any of the alternative forms 
s u=L,y (у>0:; — 1-9 


те, fo = [оу [ os, 5 
where in each case the EN is Je (6-2) №] Р " à e | : | 


ут), 
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We now prove 
(i) that the r+ 1 regions 2; are non-overlapping, so that the sum of integrals in (6-4) may 
be replaced by a single r-fold integral over the union of the #,; and 
(ii) that the regions 2; are exhaustive, so that their union is the whole space. 


Integration is then immediate. 
(i) The regions are non-overlapping 
Consider, for example, the regions 2, and 2,. If uis any point common to them, we shall 


have 
u=L,x=L,y (х>0,у>0). 


Then Lax = Щу, 
or, explicitly, 


1-4 ©. 0 => 6 = 

=f Шеф D 0 1-4 0 0 
бб eg х=|-% 1-4 0 y. 
0 б = 1 ала ТІР ый 


Тһе solution of this set of equations is 


а= 2-0. (= 23er) 
а = — Yr 
Sinco by hypothesis the elements of x and y are all non-negative, it follows that 
а = 0, = 0. 


(r — 1)-dimensional subspace, whose measure in r-space 


Any common points u thus lie in an 4 
ally, we may say that the regions have common 


ч of course zero. Speaking geometric 
Oundaries but are non-intersecting. 
A precisely similar argument applies to 


e, m (E уз fiy fem ong tls (6-5) 


the regions #;: 


all other pairs of regions R;,#;. Hence 


Where the region of integration Ris the union of 
rtl 


R= U Be 


(ii) Ther+ 1 regions #; are exhaustive 


We have to prove that every r-dimensional vector is of the form 


u-Ly (y>0), 
for some r. Now every set of r of the basis vectors А, is linearly independent, so that we may 
а, % 
а» pes a= 14x = ky = = LAW, 
= any u, witl iate values of X, y... W- Let us write these representations in the 

» with арргоргі 8 ys 

1 т 
ж и = А. +... Ана ын» 
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i t if i ;hese 
Our proof is by induction. Assume for a particular value of s that, if in one of t 
i ici ссеед s ies in one 
representations the number of negative coefficients does not exceed s, then u lies sa "e 
i 1 iv a las re 
the regions Z;. Now consider the case where in some representation (say the last) there a 
= 


ў Alty ' are 
s+ 1 negative coefficients. Suppose, as we may with no loss of generality, that these 
Wy, ..., 00544, SO that 


w= (%=1,2,...,8+1), 
and шу= +оз (ј==+8,...,7), 
where all the w, are non-negative. 
We replace A, by —A,— ++ —А„. The representation then becomes 
Ч--А-..-0 Agar + We oAgiot ...+W,A, 


= (04 — о) Ag+... 


+ (Oy ол) Agia + (Oy + O59) а +... + (v, + 9) A, +O Apia 
This is a representation of u in which the only coefficients that c. 
those attached to Хоз 
initial assumption were v. 
to one of the regions Z,. 


. А е 
an possibly be negative at 
; lei e, if our 
өзі» Thus there are at most s negative coefficients, whence, if o = 
ili ; ; ; j 1 
alid, it would follow that the point under discussion must belong 


Now the assumption is certainly valid for s = 0 
в = 0, 1,...,v. Thus any r-dimensional vector whos 
sentation do not exceed r in number, must belong t 
clearly exhausts all the possible sign variations of t 
Tt follows then that the union of the Z; is the 


, and is therefore true in general for 
е negative coefficients in some repre 
o one of the regions 2,, and since this 
he ғ coefficients our lemma is proved. 
whole space. Hence (6:5) becomes 


+o +o 
("Тс = | ој edu, ...du, 
-0 


-0 


= {| eau)! = (J(27)yr. 
Referring back to (5-6) we have É 


and finally, by (5-5), 


In conclusion, we note that 


: the asymptotie value of the mean range, for large ?^ i 
2 y () n*,in agreement with Feller's results, 
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NOTE ON ‘THE JACOBIANS OF CERTAIN MATRIX 
TRANSFORMATIONS USEFUL IN MULTIVARIATE ANALYSIS’ 


By INGRAM OLKIN 
Michigan State College* 


SUMMARY AND INTRODUCTION 


General techniques for the evaluation of the Jacobian of a matrix transformation were 
outlined by Deemer & Olkin (1951). In particular, (а) the Jacobian of a non-linear trans- 
formation is equal to the Jacobian of the linear transformation in the differentials, (b) by 
the introduction of suitable variables, the Jacobian is equal to a product of Jacobians which 
ате easily calculated. This note is concerned with a number of transformations not con- 
Sidered in the earlier paper. Certain of the transformations considered will be recognized as 
extensions of those given previously, while others are quite different. The notation and 
results of the first paper are assumed. АП matrices in this note are square of order p. 


THE JACOBIANS OF MATRIX TRANSFORMATIONS 
Turorem 1. The J of the transformation 
Ү-ХА i D(X; Ў) = П ae, 

1 


Proof. уз = X Xima >j) The scheme of coefficients is triangular with diagonal elements 
к=) 


Arr (P— k+ 1) times from 


дук (= Hi; фу Жез 1,900): 


H j "m ТЕТ 
ence the determinant is Шай "7". 
1 


Тнконвм 2. The J of the transformation ч 

ў=Х'А+АХ в DX F) = 2 Пай. 
Proof. Pre- and post-multiply by A’ and A-, respectively, and let Z-À4'3YAA, 
= Х4-1. Then Z = Ü +0' and 


D(X; У) = D(X; 0)100; 2)0(7; Y). 


We can now evaluate each component 
p 
41 ~. 7 
II ағ % DU; 2) = 


% 


D(X; 0) = П ад", DRY) = |4 "= 

1 

Совогљаву 2. The J of the transformation u " 
p-T в DT; Р) = Whe 


als of V = ТТ and making the correspondence (dV) = Y, 


2, the corollary follows. 
University of North Carolina and was supported 


Proof . Taking differenti 
(ат) = X, T = Д, of Theorem 


* 
in The work reported here was С 
Part by the Office of Naval Research. 


ompleted at the 
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THEOREM 3. The J of the transformation V = D, RD, where r;; = 1,18 


vz | L D, 


where Р,:рхр and Dy: P(p—1)/2 x p(p—1)/2 are diagonal matrices with elements 


ә 9. p * А " Е i у у 
235. лг Dn and Tito, TiTa, ..., 2, 10, respectively. The matrix L does not affect the valu 
of the determinant, which is equal to 


D 
2 Па; П a,x; = 2» П 27. 
Y dej 1 
THEOREM 4. The J of the transformation ñ = D,Ü, where S из, = Mi = 1,..27) is 
г. 1) 
Етегі 


Dx, 0; Т) = Паш, 
1 


Proof. и== E , 
f. by = Su = ту 1 — P в) > by = ии, ij). 
Hence the scheme of coefficients is Е 
2% Uy 
КЕ: БЕ 
biz | D, -Е, 
ің E, D, 


where D,:pxp and Dy: p(p— 1)/2 x р(р—1у/э 
Мур... Upp and 2, of multiplicity (4- 1), (г = 1 | 
дуда = vj = 1,2, ...,4—1), E: и i 
Bly (Я =, ат 12 Р(—1 


чу Р nts 
are diagonal matrices with elem? 


5p), respectively, The latter arise 4 
)/2 has ele i 


ments v.u. Ју, arising 8 

ш) arising from дуда, (ј = 1 ә?) and zeros elsewhere, 5: e o и our element 

таныл Алы ВЫ Фе Т E deter 
of partial derivatives ig ; өзе?) and zeros elsewhere. The 


4-і 
D t p u+ 2 
| „|| Di E, D; ЈЕ, | = Па (4-5 “i M 
т erem || | atlas 


R=00 


DO; В) = Па 
1 1% . 


› 4 $ 
»wherer,, = 1, Dw, = 1,( = ПИ. 
ізі 


roof, тие 126 5 
ИН Азышу(6<) 
Ті 24<2).Тһе sch, 
elements ty of mu] tiplicity а еше of Coefficients і 


^ arising fr 
minant of partial derivatives ig 8 from ду, Otis (j 


= е e 
frase 7 L..,i—1). Hence the d 
Tw, 
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Тнвовкм 6. The J of the transformation У = (A X)! (A — X), where А is a matrix of 


ER D(X; Y) = 2| Ao A X |. 
Proof. Taking differentials of Y = (А+ Х)-1(А-Х), we get 
(ау) = – 3(4 X)! (dX) (A *- X)? A. 
Using Theorem 3-6 of the earlier paper, 
D(X; Y) = |2(A+X)7|?|(4+X)74 |Р. 
THEOREM 7. The J of the transformation 
= (1+Х)-1(1-Х) в D(X; y) = 2ир+ | 1+Х |-Ф+ї, 


Proof. Taking differentials of У = (1 + X)3 (I-I), we get 
(aF) = -20-- X)? (dX) (1 LX) 
Using "Theorem 3-7 of the earlier paper, 
D(X; У) = | мане |n. 
= (А + X3 (4 – X), where А is a matrix of 


Тнковем 8. The J of the transformation 
Scalars +0, is 


D(X; У) = 25032 | A -X | e Пана, 
1 

Proof. Taking differentials of ў = (А+) (А-Х), we get 

(7)--%4-Х)- 1 (dX) (4+ Xy. 


Using Theorem 3-8 of the earlier paper and Theorem 1, 


уе ane t+1, 


D(X; Y) = П оца HC)” (liit Vii 
i 


Тнковим 9. The J of the transformation Y = 7D,7", where і; = 1, is 


D(f,a;Y)- frat 


sd Taking differentials of y = 7D,7" and pre- and post-multiplying by 7-1 and 
1, respectively, tum gra = т-а?) р, + (0D, )+ рату 1-1. 

Let «татты, 1-40, 0-17), 

7 = UD, +D, + D,U' 

= рат, da; 0, dw) D(Ŭ, dw; 2). 


fficients for D(U, dw ; Z) is 


Where иң = 0, then 
and D(f,s; У) = = рат, їх; dY) 


^ ju and hence the scheme of coe 


ii = 5 and z;; = и; 
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where D: p(p—1)/2x p(p—1)/2 is diagonal with elements x; of multiplicity (p—?) 
(2 = 1,...,p). As the matrix L does not affect the determinant, we obtain 


= n 
D(Ü,dw; Z) = [Dl = Па" 
ә р. 
рат, ах; Ü,dw) = | T | о) үг = 1. 
| t 
Тнковем 10. The J of the transformation Y = PTT’, where Г is orthogonal such tha 
|T+Z|+0, is DS; Т; У) = 2»»-w2 | I+ |w- П (a= ty) 
i-j 


where Г--(1 +) (I-X). 3 
Proof. Taking differentials of Y = ГТГ”, and pre- and post-multiplying by Г' and 1, 


respectively, I"(4Y)T = ГГ) T 4- (4T) + Т(агу Г. 
Let W-r(Y)r, $=Г'(4Г), 0 = ат, 
then ми ВР 

(1) У = 57+0 – 78, 


(Note. Г' (а) = Sis skew-symmetric, hence s= —®)` 
DÊ, T; у) = Dakar; ау) = раХ.ат: 6, 0) DS, Ù: W)(W; d¥)=D,D,D» 


D, = зио-у | 1+ X |--0, Dj 1, 
D, arises from (1), and the scheme of coefficients is 


Yii ог 
%%<)|01 M, 
y4«j)|00 N 
the determinant of which is |У |. Ni 


ва triangular matrix with di 
(1--)) arising from дуу = by — 


agonal elements ¢;;— tii) 
tii where 


D 1 
Уш = У Siti У, syst. 
k-j k=1 


Hence || = П (0—4). 
<j 
CoxNcLUSION 
In this note, the Jacobians of cert 


i ; > а. 
Theorem 2 j ain matrix transformations have been evaluate’ 
em 2 ‘ 


8 
of the Toeplitz factorization which ч 


heore 


P» orem 1 "e 5 йені; ield5 
the characteristic roots of a matrix is тайы” the Smith canonical form which y! 
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ESTIMATION OF A FUNCTIONAL RELATIONSHIP* 


By D. V. LINDLEY 
Statistical Laboratory, University of Cambridge 


It is known (Lindley, 1947, § 7-2) that if there is a linear functional relationship between 
two quantities, if the only observations available on these quantities are subject to error 
andif all the distributions are normal, then there is no satisfactory estimate of the functional 
relationship unless the relative accuracy of measurement of the two quantities, or some 
Similar knowledge, isavailable. Furthermore, in this situation there are three relationships 
Which exist between the two quantities (either with or without error), namely, the original 
functional relationship and two regression equations which, in this case, will be linear. The 
experimenter who encounters this situation is therefore faced with a difficulty and a com- 
Plexity in the mathematics which he feels is foreign to his practical situation, and it is not 
therefore surprising that numerous attempts have been made to remove the undesirable 
features of the problem. The fact that the experimenter finds difficulties which he cannot 
ematical model is unsatisfactory. A new model has 

and with this model he claims that the estima- 
tion ің possible, йан there is only one regression and that this regression has the same con- 
Stants as the functional relationship. Kendall (1952) has raised some doubts about Berkson’s 
Procedure, and when I read Prof. Kendall's paper before publication I thought I agreed 
With him. Further consideration has revealed that Berkson’s procedure for estimating the 
functional relationship is sound and his mathematical model, incorporating the idea of 
à controlled observation, is likely to be of very wide applicability. In the present paper 
ie à mathematical justification of Berkson's procedure which may help to clarify the 

sue, 


et interpret suggests that the math 
^erefore been suggested by Berkson (1950), 


As a compromise between the notations of Berkson and Kendall random variables 
(Variates in Kendall’s terminology) will be denoted by lower case Roman letters and other 
quantities (mathematical variables) by capital letters. Any random variable or quantity 
Which can be observed will have a prime attached to it. We now postulate the existence 
Of two quantities, U and V, which are connected by a linear functional relationship 


V=a+fu. (1) 


Nothing is said about how U and V are to be obtained; they may be obtained by a random 


Process во that (1) may equally be written 
v=atfu, (2) 
a random variable so is the other. In the problem 


but, in view of the relationship, if one is 4 у 
able; observations are only available on 


Under consideration U and V are not observ: 
xv = ud, (3) 
y =®+е, (4) 


* See editorial note on р. 49 below. 
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" i tims Ж ^ several 
where d and e are two random variables. The problem is to estimate z and б from 
pairs of observations on >” and у’. If we now assume 
(i) that u (and hence v) is normally distributed, 


(ii) d and e are independently normally distributed with zero means, and 
(iii) d and e are independent of u, 


, , will 
then estimation of æ and / is not possible and the regression lines of у’ on x’ and z' on y wil 
not be coincident and neither will have the const 
estimation does not seem possible even if ( 
This is the difficulty referred to above. 
Berkson queries the formulation (3) 
that we have a true value u (or U) w 


+ А : ‘he 
ants of the functional relationship. 14 
i) is omitted and U is not a random varia 


and the assumption (iii). Taken together these imply 

hich we are unable to measure directly but in -—. 

а measurement оп и we introduce a random error d which is unbiased and independent of i 

so that z' is of necessity a random variable, Berkson calls z' an uncontrolled disi | 

He suggests that observations are not always uncontrolled: often we bring a при“ | 

to a value determined beforehand (whether or not by а random process will be seen n0" | 
| 
| 


ity used in 
matter) so that our measured quantity is fixed but the actual value of the quantity useC © 
the experiment will differ by an error d from the me 


atically ! 
asurement. Hence mathematically 
we take the case where the measurement is not а т: 


andom variable 

, (5) 
и = Х'—а, 

where we may assume d to be nor: 


тв е -us sign 16 
mally distributed with zero mean, and the minus 819 
introduced in order to show the a 


nalogy between (3) and (6). Rewriting (5) ) { 
6) E 
Х' = и+а, | 
П {8 
and u and d are no longer independent, Х' is called a controlled, observation. Berkson sugge? 
that in many experiments one value is controlled, the other is uncontrolled; for examP 
we may bring the current to 9 


A 
5 “атар. (controlled) and measure the voltage as 3:67 v. p 
controlled). In this case Berkson argues that estimation of g and р is possible and t! 


| 

) 

regression of y' оп X is linear with the Same constant. The proof of these statements ! 
simple. From (2), (4) and (5) у 
У –е= a+ B(X'—4) | 


У-а- X! (е " 
= OG BX' f, (і 


andom variable wit tae 
farand € Ме with zero mean and normally distributed. This 18 Е 
om variable y on a, fixed variable 


4 reg? 
shed and the usua] estimates X, and accordingly the 


p-3w-—y) ox, 
“Зара шап} жы, F 
X(X'— Xe > 0-g9'—bx 


ie. 


wheref = e—fdisar 
ordinary regression о 
sion stated is establi 


Optimum properties, Furthermore, eve? s 
) Berkson’s argument рег m 
have (2), (3) and (4) we? (8 


because f is independent of x’ i t igi 
24 . in 
| of he original 


Y= 0+ fal ау, 
but ?' and f are Correlated, sing 
SIS Ceases to appl 


analogous to (7) 


al 
H “is a 
regression analy, “Жұ ылғы : а сна With 4, and thus the ple 
8 Shows the estimatio impo? 
n to be imp 
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pores the essence of Berkson's idea is to suppose d independent of z' (where I 
ee d = independence in the case where v is a random variable) in contrast with the 
ae шор ion that d is independent of и. The above analysis shows that in Berkson’s 
Сасы а е problem is emsivalens to that of the regression of y' on X’ (ог x’). If X is not 
а variable then Berkson’s procedure can be justified without the assumption of 
8 a ny by the familiar least squares argument and an appeal to Gauss's theorem. 
win E has caused rae in the past it might be helpful to rephrase Berkson's first 
Жаш е j 173). If x’ is controlled and y’ is not, then the regression of у’ оп 27 is of the 
scat : has the same constants as the functional relationship, and the latter may be 
пи у ^a he usual regression methods. If y is controlled and а/ is not, the regression of а” 
ned | cem properties. But note that in the former case there will still be a regression 
БО : = it will not have the const: nts of the functional relationship; it may not 
ir te near. So that Berkson's title ' Are there two regressions? ? is still answered in the 
im) е) ч the controlled observation isa random variable (a case not considered by 
ien didis the negative when it is a mathematical variable, since regression of a 
авна variable has no meaning. Nevertheless, his procedure for estimating the 
relationship, which is quite a different thing, as Kendall points out, is valid. 
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Қ Ек Note. In his second paper on ‘Regression, structure and functional relation- 
Probla endall (1952) criticized certain of Berkson’s (1950) conclusions regarding regression 
па = when dealing with controlled experiments. To these criticisms Dr Berkson sub- 
hess reply which in turn led to more comments from Prof. Kendall. In considering 
hie е contributions as Editor, I recalled how often in Чаробни disagreement 
meth yee quite inadvertent misunderstanding of the other man’s terminology, notation or 
ет Od of approach; to clear the position an independent re-statement of the problems at 
thie iad be needed. The preceding note by Mr D. V. Lindley appeared to be so helpful in 
in theese that I asked the two contestants whether they would agree to its publication 
i Ce of their own explanations. This plan they both welcomed; it is, however, right that 
оша add that, while Dr Berkson finds himself in substantial agreement with what 


bs Lindley has written, Prof. Kendall is not convinced that the points at issue have all 
en cleared. E.S.P.] 
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ESTIMATING PARAMETERS IN TRUNCATED PEARSON 
FREQUENCY DISTRIBUTIONS WITHOUT RESORT TO 
HIGHER MOMENTS 


By А. С. COHEN, Jn., The University of Georgia 


1. INTRODUCTION 


Truncated samples arise whenever observed measurements are restricted to an interval 


2. SOME FUND 


AMENTAL MOMENT RELA 
| TIONSHIPS 
In the writer’s paper (1951), 


i ў of 
moment equations previously referenced, it was demonstrated that the syste 


ОЛЛО 1) 
= 15-55-52), ae: = ( 
where / = a+b, and д is the kth moment o +1 = Miss (k= 0,1,2,3) 


, 


: 100» 
follows readily from the differential equation f the general Pearson frequency func 
l af (x) ne а-т (2) 
Equations (1) differ from th и па ба “of 
hin place of q е usual equations fo: ү 


of course, derived fy T the Pearson System only in the us? о) 
i oth {БЕД ee the assumption that the exP!” з 
titted’: 8 of f(z). F о 
Тех А mited in ) Ног the Pearson curves e? 
ceeds a certain value which depends dm or both directions, this will not be true 
б 
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ЕР 1 he origin from which 2 is measured is designated £ in standard units referred to the 
stribution mean, then , 

иу = —0%, g 


B с is the standard deviation. 
now the non-central moments of (1) are replaced by standard moments аһ where 


Е (E ; 
= () ии) о", (4) 


t 8 
he resulting system can be solved to yield 


h=oa, = 0, = сү, 


= 6+ 328 — 204 (5) 
2 = 2(9 + 628 — 504)“ 
Pus = -£1-259. 
0 = [(1 — 36,)+ & (1 — 4) 2+ 03, (6) 


у = [3/2 — 265 (3—5). 


3. GENERAL FOUR-PARAMETER DISTRIBUTIONS 

ated both on the left and on the right, and if the 
<d, where d is thus the truncated range, the 
d in a manner similar to that by which (1) 


ox o bly-truncated samples. If f(x) is trune 
Sin is taken at the left terminus so that 0 & 2 


ollowi 
пина system of equations may be obtaine 
ав derived: 


h+ 2m,ba +F -F = т, } МҸ 
мт, + Бойтра + by Me + balk +2)mp Е = тыл (2-1,2, 3), 
wh 
P в – бобу F =D [bot db, + dbs) (8) 
g i 8; 7) + 
? is the kth moment of the truncated distribution about its left terminus, 1.e. 
à 
ity = | akfle)de with m, = 1. | (9) 
0 


E Should be noted that truncation does not alter the differential equation. The constants 
ie (7) are therefore identical with those in (2), and may be expressed through (5) and (6) 
rms of moments of the complete distribution. Further details are included in the writer's 


“lier paper. 
T s left and right truncation points in 8 
ed to its mean are designated as £' and £t 


F =02% ша F= с23, 


tandard units of the complete distribution 
respectively, with £” = & + а/с. We can write 
Where (10) 
2 = 
17 LBE P] f — 36) + {А — 4%) 98/2 ЫЛ, | 
= AE јр] (ra — 369 +2 БЕРТІНДЕ Б 26488 — 891+ (1с). E. 
1 


f In the original Pearson notation, & = Pr ба = Pa 
4-2 
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s $ 5 5 r 
The foregoing results follow from (8) when we substitute the values given in (5) and а m 
b, and b, and simultaneously substitute for Јо) and f(d) the ratio of standardized ordin 
to area Ж га 
Ло) = Ф6)(ро) and Ја) = 4(&")|(po), where p = и P(t) dt. 
Tf the values given in (10) and (11) for F, and F and those in 


(5) for h, b, and by are sub- 
stituted into (7), the resulting system of equations becomes 


ту = o[Z,—Z,+A]](1—2by) 


= 001, 
То = (А+), +0 = (4/0) 4/0 —3b,) = ag, (13) 
ma = ОЦА + 2) go + 20g, — (d/o)? 2,1101 4b,) = 0%, 


Ma = EA +34) дз +309, — (do)? 201 


50) = 0494, 


ға 
Where, 92,9, and g, are functions ой, с, cand c, as defined above. It must be remember 
that 0, A and у as defined in (6) 


are functions of 27, & and a, but not of с. 

When observed sample moments are equ 

i.e. vy = ту, where vy is the kth observed m 
n 

terminus (2 = Es In 
= 


estimating equations 


d ts 
ated to corresponding population me 
oment of the truncated sample about its 


of 
‚ and these values are substituted into (13), we obtain the system 


Vy mong 


4 
HAE", ст, ож ож) (= 1,9, 3, 4). И 
The stars (*) were added to 


distinguish estim 
estimates, £^*. у 


; for 
ates from parameters, After solving (14) 
„ож and of, the mean ¢ 


an be estimated with the aid of (3) as 


ИД“ = — оз”, (18) 


с | ght be obtained by using the writer's metho 
gher moments (1951) or регћа; ч 
ps even by judicious gues р a 
а the usual moment, estimates co i on кар 
suffice 


Singly-truncated : гез 
Ju are functions of been only а single tail’ is missin ~ ош 
equations of (14) are їп this мац . Accordingly, by eliminating с between them, th 4 m 
their solution is thereby greatly iat eed to three in number. The labour involV 


d 
ап 
g from the sample, 1» 92:93 


4. 
Type 111 distributions. 
assume the more simple 


Ту 
PE Ш Anp NORMAL DISTRIBUTIONS 
When Ж 


; 6) 
2) is of а ( 
forms ) is of type ІП, then b; = 0, and equations (5) ай 


h--gg 

> А=-Е 
bo = от, 12), ө : (10 
1 = 00,9, | i 1+ 692, 
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á Subsequently, it follows from the above results together with equations (13) that for 
oubly-truncated type 111 samples 


оу = [21-23-81], 
в = [003/2 ') g1 + (1 + 803/2) — (d/o) 2.1, (17) 
да = [(®—#')0»+ 21 82/2) gy — (0/0) 21. 


ч 
m 


Similarly, it follows that 


Ж, = [H(p] + 59/2], | т 
2, = (ЕВА +92) + (4/0) (/2)].) ~ 


Bus only the three estimates Z'*, o* and аў are required in this case, it is necessary only 
07 = 1,2 and 3 in (14) and then solve the three resulting equations. 

S d singly-truncated type III samples, the first three equations of (14) immediately 
io О two when с is eliminated between them. This case has been considered separately 
the а), and no further comment seems necessary here except to note that the function Z of 

>. aan paper was given as 20 in the earlier work. 
rie normal distributions. When Л (x) is the normal distribution, b, = b, = 0, and the 
of ie ABB equations of (14) reduce to identical forms obtained previously using the method 
aximum likelihood for both singly- and doubly-truncated samples (1949, 19506). 


5, A PRACTICAL APPLICATION 


i Practical application of results of this paper can be conveniently demonstrated with 
heey, example previously employed to illustrate the method of higher moments. The 
( iem data consist of weights of 1000 female students (cf. Table 1) as given by Miss Shook 
the )- Miss Shook computed estimates of population parameters from this sample under 
s kar api that it was a complete sample from a type ш population. For the present 
Ж. ration, a doubly-truncated sample was obtained from Miss Shook's data by arbitrary 
Neation at 79-95 and 159-951b. respectively. Thereby the first and the last six cells of the 
Stouped data were eliminated, and the truncated sample thus formed consists of 981 
Servations within the interval bounded by the above terminal points. 
Ж first three sample moments about the left truncation point both before and after 
‘Ng appropriate corrections for grouping errors are given below: 
Corrected moments 
= 37833843 = _ 37888848 
ру = 1651-73293 


na= 1660-06626 
n, = 81,227:1916 80,281-3455 


Uncorrected moments 
Vs 


d only when high contact conditions at the terminal points 
Those employed here are due to Craig (1937), which he 
s for discrete data, derived under conditions which do 
fication of their use in the present instance, note that 
be viewed as consisting of discrete measurements, 
h grouping accomplished by placing sequences of 
ve 101b. intervals. When final 


т, ш corrections which hol 
ane are not applicable. 
Not а, es as Sheppard’s correction: 
Miss on high contact. In justi 
вас 100128 original raw data may 
lon Ccorded to the nearest 1010. wit 
Consecutive values of these discrete values into successi 
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і і to be 
graduations are compared with observed data, the foregoing contentions seem 
further substantiated. | ПРИ 
Since our sample is doubly truncated from а type III population, the requir ed еа 
can be computed from the first three equations of (14). In the present aa ы. 
it seems preferable as a matter of practical convenience to turn back to the fi 


; tu hseauently 
equations of (7) with b, = 0. From these we compute estimates of h, by and бу. Subsequ 
we calculate estimates of #', с апа a, using the relations 


с = (bo tbh), аз = 2bo and £ = - Цо, 
which follow from equation (16). 


Р Р imating 
On equating observed sample moments to corresponding population moments, esti 
equations for the truncated sample under consideration become 


h* 


37:883843 — F* + [*, 
00 +37-833843b* = 1651-73293 + 80F* — 37-833843h*, 
75-6676866% + 3303-465866* 


= 80281-3455 + 6400F + — 1651-73293h*. 
Starting with first approximations & = — 2-5 
calculated as usual moment estimates, neglectin 


truncated, we employ the iterative method deser 
these results, After seven iterations 


" ich wer? 
, 0, = 15 and оу, = 0:35, which 


; al 
g the fact that our sample is ие. 
ibed by Scarborough (1930) to impr? 


: ур whic? 
Т we obtain #'% = — 2:43, 0% = 15-85 and аў = 0:61 W p 


0 
increased somewhat as a опао и 
n to æ, (0-35). When calculations 4 
б, 7, = 16 and & = — 2.42 which 10 
meters, the iterative process le 
before, after only two iterations instead of seven. with 
в obtained here was performed 1 


along with Miss Shook’s original 8°" 160 
5 computed from the compl 


: ete sam i in Table L s 
included in this table is a graduation taken from the dnd ы т which к 
та. on estimates from the same doubly-truncated sample considered ей 
Мом я зей frequencies based on estimates computed from the trae ug 
sample moment, eniti ie TE t with observed frequencies than those based on CO? st 
estimates are unduly affected i risa ud to the fact that complete sample (ye 
truncated sample, A i ee 
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using the higher order moments in their calculation. Either set of estimates, for the present 
illustration, would certainly be considered adequate as first approximations on which to 
base iterations to maximum-likelihood estimates. This topic is one which the writer hopes 
to investigate more fully in a subsequent paper. 


Table 1. Weights of 1000 female students 


Graduated frequencies 
Doubly truncated sample 
Weight Observed Complete 
(Ib.) frequency sample қ Graduation 
B (Miss Shook's bon ro: based on 
graduation) hi on estimates of 
higher sample 3 
this paper 
moments (first (first three 
five moments) omental 
70-79-9 9 0 0:2 0-2 
80-89-09 16 4 12-7 13-8 
90-99-9 82 102 94-0 93-2 
100-109-9 931 238 2140 212-1 
110-119-9 248 250 253-0 255-0 
120-129-9 196 184 200-8 202.9 
130-139-9 122 11 120.6 121.2 
140–149-9 63 59 59-4 58-5 
150-159-9 23 29 952 24-1 
160-169-9 5 13 9:5 8.7 
170-179-9 7 6 3-3 2-9 
180-189-9 1 3 1:0 0-9 
190–199-9 2 1 0:3 0-2 
200-209-9 1 0 0-1 0-1 
210-219-9 1 о 0:0 0-0 
Total 1000 1000 995-0 993-6 
Total frequency 981 977 980-6 980-8 
іп truncated у 
тапре 
M* (Ib.) Е 118-74 118-56 118-43 
o* (Ib.) ыз 16-9175 16-02 15-85 
ed - 0-976424 0-657 0-61 
Lower limit (Ib.) — 84-09 69:77 66-46 


‘Truncated sample formed by truncating the complete sample on the left at 79:95 Ib., and on the 


“ight at 159.95 1b. 
. А sample of the calculations involved in one cycle of the iterative process will serve to 
lustrat details of the technique. At the end of two cycles, we obtain as third approxima- 
1918 (a4) = 0.50, с, = 15-5 and & = — 2.44. Since &" = £' +d/o, and since for the example 
3 3 ; 
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under consideration, d = 80, it follows that 2% = — 2-44 + 80/15-5 = 2-72. From Salvosa’s 
tables, we read 


272 
$(—2-44) = 0-005050, (2-72) = 0-017913 and р - | A(t) dt = 0-989868. 
-944 


On substituting the above values in equations ( 18) we compute: 


0-005050 
аа — 2-44) (0-5)/2] = 0-00199, 
E Dol HERRERA] 


1 = 


0-017913 МН — 
and Z= pu [1+ (— 2-44) (0:5)/2 + (80/15-5) (0-5/2)] = 0-0304. 


Finally, from equation (10) we have as third approximations, 


Е = 15-5 (0-00199) = 0-03 and Р = 15.5 (0-0304) = 0-47. 


These values are rounded off to 0 and 0-5 respectively, and the estimating equations for the 
present cycle of calculations then become 


h= 38-334, 
bo + 37-8346, = 241-416, 
by +43-658b, = 266-482. 


On solving for b, and б, we obtain b, = 4-304 and b, = 78:577. Subsequently we calculate 
fourth approximations 


Ta = {78-577 + (4-304) (38-334)) = 15-6, 
(ад), = 2(4-304)/15-6 = 0-55, 
& = —38-334/15-6 = — 9.43. 
Using these values as new (fourth 


) approximations, we re 
the desired accuracy is achieved. 


peat the process as required until 
As stated previously, seven iterations were necessary to 
compute the final estimates give 


| n for the present example. In the subsequent iterations, 
more decimals were carried and the rounding off of F, and F was not quite so drastic as in the 
cycle illustrated above, 


tables of И and 


Pearson distributions, 


т) and can be a; i 
described by Cramér (1946), i = i 
of this paper remains 


& 


A. C. CoHEN 57 


REFERENCES 


Сонкх, A. C. Jn. (1949). On estimating the mean and standard deviation of truncated normal dis- 
tributions. J. Amer. Statist. Ass. 44, 518. | 

Сонкх, А. С. Jr. (1950а). Estimating parameters of Pearson type ПТ populations from truncated 
samples. J. Amer. Statist. Ass. 45, 411. 

Conen, А. C. Jn. (19506). Estimating the mean and variance of normal populations from singly 
truncated and doubly truncated samples. Ann. Math. Statist. 21, 557. 


Conen, А. C. Jn. (1951). Estimation of parameters in truncated Pearson frequency distributions. 
Ann. Math. Statist. 22, 256. 
Cnara, С. C. (1937). Sheppard’s corrections for a discrete variable. Ann. Math. Statist. 7, 55. 
Свамев, Н. (1946). Mathematical Methods of Statistics. Princeton University Press. 
Ківнен, В. A. (1931). Properties and applications of Hh functions. Introduction to Mathematical 
Tables, 1, xxvi-xxxv. Brit. Ass. Adv. Sci. 
Ler, Атлсь (1915). Table of Gaussian ‘tail’ functio: 
10, 208. 
Pearson, KARL & LEE, 


tion. Biometrika, 6, 59. н j 
Siivoa ni. iis Tables of Pearson's type III function. Ann. Math. Statist. 1, appended. 
ScARROROUGE J. B. (1930). Numerical Analysis, рр. 191-3. Johns Hopkins Press, Baltimore, Md. 
Smoox, B. L. (1930). Synopsis of elementary mathematical statistics. Ann. Math. Statist. 1, 14. 

; B. L. Я 


ns when the ‘tail’ is larger than the body. Biometrika, 


ALICE (1908). On the generalized probable error in multiple normal correla- 


[ 58 ] 


A PROBLEM OF INTERFERENCE BETWEEN TWO QUEUES 


By J. С. TANNER 
Road Research Laboratory, Department of Scientific and Industrial Research 


1. INTRODUCTION 
In most of the queueing situations which are amenable to mathematical analysis, the 
queueing units (customers in a shop, etc.) are all tr 


‘ying to achieve the same object, for 
example, in the case of a shop, to be served. Usually, the only differences between successive 


customers are their times of arrival, and perhaps the time they take to be served. Congestion 
‘then arises because only a limited number of customers can be served at a time. A feature 
of many road-traffie queueing situations, however, is that there are a number of different 
‘populations’ of ‘customers’ (in this case vehicles), and congestion arises not so much 
because too many vehicles want to do the same thing at the same time, but because a number 
of vehicles want to do different things which cannot all be done simultaneously. For 
example, at a cross-roads, less congestion is usually caused by a number of vehicles pro- 


ceeding in the same direction on the same road than by a smaller number of vehicles, half 
on one road and half on another. 


2. STATEMENT OF THE PROBLEM 

The physical situation associated with the statement and analysis of the problem has been 
chosen as one which is fairly easy to visualize. It is not necessarily the most useful applica- 
tion of the results. кай 
We have а road of which a certain len, 
and outside which there is free flow in 
at А at random at an average ra 
vehicles V, arrive at B at тапа 


until both these conditions are satisfied. 
there is no V, in AB and no other V; hase 


times are negligible, though these can 
constants o, and o. 


nb 
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The problem is to find the average delay W, to vehicles V, and W, to vehicles V, when a 
stationary solution exists, i.e. when the rate at which vehicles can pass through АВ is greater 
than the rate at which, in the long run, they will arrive, so that queues of vehicles will always 
clear themselves, given sufficient time. Other aspects of the process might be of interest, 
for example, the frequency distribution of the waiting times, but the average delays are 
considered to be more important and are easier to find. 

There are three distinct cases which should be considered, according to the relative 
magnitudes of the огв and f’s: 

6) ef %> в», 
Gi) «>, 9а<Йа (oro < В, а> Po), 
(іі) o, «fi. <. 

Cases (ii) and (iii) appear to have fewer applications and present mathematical difficulties 
of rather greater complexity than case (i). We therefore confine attention to the latter case. 

The physical situation associated with case (i) is as follows: If there is a V; in AB, then 
V/s will not have an opportunity to enter AB until there are по V,’s in AB or waiting at A, 
since there will, until then, always be at least one V, in AB. Similar considerations apply 
for the other stréam, and we may say that at any moment either V,’s or ўв or no vehicles 
have control of AB, according to whether it is occupied by Vj's, V;'s or is empty. In cases (ii) 
and (iii), a queue of vehicles may not be able to clear in a single ‘ block ^; vehicles in the other 
stream may be able to interrupt the flow, even though the vehicles in the first are at their 


minimum separation. 


The concept of regeneration points (Kendall, 1951, p. 184) gives an indication of why 


cases (ii) and (iii) appear to be less amenable to mathematical treatment. In case (i) there 
are various convenient sets of regeneration points; for example, the beginnings of all 
blocks of V;’s. In case (ii), the only comparable set appears to be the beginnings of all 
blocks of И”, while in case (iii) there are no comparable sets. The method to be used for 
case (i) makes use of two sets of regeneration points and so is inapplicable to cases (ii) 
and (iii). Of course, any defined set of instants forms a set of regeneration points in any 
of the three cases if the precise position of every vehicle in the system is included in the 
available description of the state of the system at any time, but such a set would not be 


of practical use. 


3. ANALYSIS OF CASE (i) (2 > Њу, Oy > дз) 


үү. t the analysis in the most general form, though the equations which arise have 
e carry ou 


= 0 or fa = 0. 
какша кшй Н AB is occupied by either a V, or a № or is empty. Periods 
any time, 


ibilities hold will thus alternate in a manner which must be deter- 


when ipo m ue mean delays in terms of the distributions of the lengths of these 
mined. We then 


periods. 7 
3-1. Notation and definitions 
eriod when AB is controlled by У; 5, whichis immediately preceded 


A VB, (У block) is a P t the case. V B,'s and V B's are similarly defined, the. 


and followed by periods when this is E 
latter being periods when AB is empty. 
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fo; t and t, are lengths of VBy's, VB,’s and VB,’s. Each t; (i = 0,1, 2) has a distribution 
whose moment generating function (m.g.f.) (607) with respect to an argument T is МАТ). 

тү and r, are the numbers of И’ and И’ waiting at the beginning of VB,’s and V By's 
respectively. 

t |r,, with m.g.f. 14(7'|7,), is the length of a V B, conditional upon the number of V/'s 
being т; at its beginning. f; | 7, is defined similarly. 

Фо Фу and ¢, are the relative frequencies of VBy's, VB,’s and Вз, with goto, +ф = 1. 


qi; (tj) is the probability that a VB; chosen at random was preceded by a ИВ, 
Other symbols: 


М(—») = №, MM —q,) = М, 


9; = d, d — d; М, ô> = qd1-- da =q Mh, 


у= M, + M,- M, My, 


& = [6-4 — 1 ја, в = [609% — 10, 


v is a function of 7, the smaller real root of 


у = efto 9T. 
Similarly Vg = fils D aT. 


Рі- Еф, P= воду. 


It may be shown that the condition for a stationary solution to the problem to exist is 


Pit p» € 1, and it is assumed throughout that when this is satisfied, all the series which arise 
converge and the m.g.f.’s of all the distributions exist. Proofs of these assumptions could be 
given, but they are thought to be sufficiently intuitive, 


3:2. Outline of method 

Stage 1: Find the 4) and the ф, in terms of М, and М, (and the basic constants). 
Stage 2: Find the distributions of 7; and 7, also in terms of M, and М, 

block immediately preceding, ` » 


Stage 3: Find the distributi 
the distributions of these jus 
The assumption that the lead: i 
t 
МАТ) and МТ) which conta; i mae! сыйы Шы 


by considering the 


ons of t, and fg, 


first for fixed 7, and 7, 
$ obtained, 


and then averaged over 


arrived during the interval 14, Le. it is e7424, 


is the chance that no V, 
Th Кро 0 hat по V, 
of t. Hence 98 Pio = Ele 1^). taken over the distribution 
Pio = №. 
калын | Poo = М, Фу = 1 =M, Ра = 1 =, 
Also = „у q 
Por dE жа) Ро = -— - 


=u 
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It is easily seen that Фо = роф + P20 $2 = Yor Фз + оф» 
91 = Раф. Por Фо = таф + oho 
‚ Фа = Ро»фо + Ру» = Фофо + 91 фа, 
and these equations, together with qoi + 451 = 1, ete., give 
| фо = Wt )[[91 + ёз + WG +93), 


фл = 459, +8. +7(9: +92)]; (1) 
фу = Ój/[0, - 0 -- Y (01 + @)], 

doi = 917/63, Фа = (1—25) 81/6», 

doz = 127/515 Фа = (1—21) 0/5), (2) 


Фо = М,бь/{У(ау +4) Фо = Me DIV +92}. 


3:4. Stage 2 

| We consider the distribution of r,. The chance that a random УВ, is preceded by a V B, 
is о, and in this case 7, = 1. Consider now а random VB, of length 6. The chance that just 
r Vis arrive during this V B, is 

e-th(g,ty|r! (r = 0,1,2,...). 
Averaged over all V B,'s this is В[е“ч(ађу југ, 
where the expectation is taken over the distribution of tą. Hence, considering only those 
V By's which are followed by VB,’s, the probability of just 7 is 


ео, ҺР – езі) (r= 1,2,3,...). 


The m.g.f. of this probability distribution is 
МАТ) = Ele *^ Y, (бегу ИТ М) 
= е-ацеше"- 1))(1- М) 
= [Mylqy(e” — 1)) — Mj — №). 


We thus have the m.g.f. of the distribution of 7; as 
0016" + а Ма (27 — 1))— M/A- My) = Gye" [ôs + 6 | ,(0,(67-1))-.М,0, (8) 


and similarly that of 7; as 
qs ye? [8; + Sal Th (qs(e7 — 1)) — M; ]/8,. (4) 


3:5. Stage З 


We require the following lemma: 

Lemma. We һауе a queue in which the units arrive at random, at arate 4 per unit time, 
each unit taking a time 2 to be served. Initially there are 7 units in the queue, and the first 
one starts to be served. Then the m.g.f. of пр, where n is the number of units which will be 
me there are none being served, is v” where vis the smaller real root of 


у = ебфФ—Ю+/Т, (5) 


served before the next ti 


When  — 1, the problem is that of finding the distribution of the number of individuals 
gle ancestor, to use population mathematics terminology, 


in all generations arising from a sin, | 
and has been studied by several writers (e.g. Harris, 1948; Good, 1948; Kendall, 1951). 
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A proof will not be given here. The result for a general value of r follows immediately since 
n can be regarded as the sum of r independent components, the descendants of the initial 
units. 

Borel (1942) has given the frequency distribution of n for the case r = 1. It is of interest 
to note, although we do not need it here, that his method applies almost unchanged for a 
general value of r, the probability p, of n individuals being 


Py = "(В)" ran (mr) (no rr ...). (6) 


The m.g.f. of nf can quite easily be found from the above expression for р. 
We now apply this lemma to prove 


| уп ea, (7-23 (а. — T 
(i) |) = —— т 


а —Т— уд, + 019, 61-200)? (7) 
. y'a dta B3 (T9 (q, — Т) 
(а) MT |r) ds — T — vaqa + va qu d PT" (8) 


We consider (1). 1, | 7 is the time that elapses from the initial state in which 7, V,’s are waiting 
till there are no V,’s in AB. This time can be split up into the following components: 

(а) an ‘ri wave’;* 

(6) an equal number, A, of unit waves, and gaps whose distributions are negative- 
exponential between 0 and од — f; Ais a random variable; 

(c) a final time a, — f. 

These components are distributed independently of each other, so we determine tho 
m.g.f.’s of each and take their product to give M, |r). 

For (а), the m.g.f. is и", by the lemma. 

The gaps іп (5) have the distribution 


а- р: 
aca nendi (0t o, — f), 


| Каву) Ti 
whence their m.g.f. is em сезш 
Ф = Т в«а-да-17 


The ‘waves’ іп (5) have m.g.f. vı, by the lemma. The probability P(A) that we have just A 


gaps and waves is the probability that the first A gaps following waves are all less than 
a, — Ву and that the next one is greater thana,—f,.Thus | 


P(A) -П- e Ca- Роа e л-р), 


+ =h 
since Г Феі = 1—e-0-5)a (%=0 кеј 


The lengths of the gaps and waves are all та 
nins dede bie all independent of each other and of the number of 


e 
> [1 еее 592; | Ф _е©-#0(Т-а)_ ii 
A=0 


ES eel ae е—@1—в1) а 
Ф-Т ema 1 m 4 2 
lpn = fda дут) 
Hence we finally have "а-Т ma 


MT |n) = ALAT (а, _ т, 
94—T у + 1962—2007) 
(7 | т) may be derived likewise, 
* This is defined as the np 
no Vys waiting. 


The expression for М, 


of the 1 i 
emma, and is f? greater than the time that elapses until there are 
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We next average M,(T | r,) over the distribution of 7; given by (3). We have 
МТ) = ET |т)) 
саз“ Ву –а) (т, — Т) 
-а-Т-ла y, g, e FOP 


eer) (di же; Т) x: [5r log v; бү Лоби; 
q1- T diit di y, dba) да Vien: б, sae -1)- m| à 


САБЫ) 


Hence, finally, 
ба – T — dvi + а-#юї-а)] МТ) 
= = ба-а (q, — Т) [У 9: (ад (5 — 1)) – М. (9) 
Similarly, 
Silo — T — qa va + qa v, cea 0207792] МАТ) 
= аа (д, — Т) [qr yv 8s (M(g,(41)) М). (10) 
The solution of these equations in the unknown functions M,(T) and МТ) to give the 
moments of the distributions of t, and t has not been achieved in the general case. The 


solution has, however, been obtained when one of the f’s is zero, and this is considered 


later. We note, however, that in the general case, expansion of (9) and (10) leads to 
(1 — 04 — 93) дыма (в) = 26101 — pa) +9, €2P1+P1Y | 


, (11) 
and (1— р: — Pe) б/а (5) = 956; pa + 6162(1 — P1) арау.) 


3-6. Stage 4 
We now derive the mean delay to a V; by considering the average number of V/'s waiting 
at a random instant. Suppose that the average number waiting during à VB, is m, and 
during a VB, is ms. Then the average number waiting at a random instant is 


— ms фу на () + ma фә ts (45) (12) 
Pol (m + Ge) + Фу (А) + фора (65) 
since фо(а, + qs), -.. are the relative amounts of time in V By's, .... The mean delay, which 
we denote by 201, is then 
®, = ^ (13) 
1 


ge number of И’ waiting will be 4414, since the expected 


In a VB, of length t, the avera, 
f the block is 4,4. Hence, averaging over the dis- 


number at a time t after the beginning о 
tribution of t, we obtain 4 
Е(1ау%.%) _ За) ) 


то =“ Вы) Mlt) 


It thus remains to find m;. 
We find m, first of all in te 

as previously defined. Consider a VB, 

the block into a number of parts, durin 


immediately, as follows: 
(i) During the r-wave, expected total delay = y(r). 


(ii) During the unit waves, expected total delay = (62-20% — 1) и). 
(iii) During the gaps. expected total delay = 0. 
(iv) During the final æ — Ва, expected total delay = 0. 


rms of y(r), the expected total delay to V,’s during an 7-wave, 
which starts off with an 7-wave. As before, we can split 
g which the expected total delay can be written down 
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Hence, adding together for the whole block, 


expected delay total = y(r) + (« а“ Рот — 1) Џ (1). (14) 
Averaging over the distribution of r, we obtain 


expected total delay = y(r) + (« а“ Рот — 1) Џ (1). 


Also, average length of block = и1(ђ). 


my is now obtained as the ratio of the last two expressions. 

To find у(г), we break up the r-wave into ‘generations’, as follows (the procedure is that 
followed by Borel, 1942): While the initial 7 vehicles are departing, suppose a further а 
arrive; and that while the a are departing, a further 6 arrive; and so on. The number of such 
generations can be indefinitely large, and the chance that the sizes of successive ones are 
7, а,б, c, ... is evidently 


Ба, а ЕЕ АЕ 


а! ы с! 
e-nrgatb ... уаде „.. 
"— WHRL. — 9 we беди" (15) 
. (0? is to be taken as unity). 
In this case, the expected total delay is 


&fir(— 1) - a(a— 1) 3- b(b — 1) + ...]+ 3/4, [ra 4- ab + bc + 35s]: (16) 
The terms in the first bracket arise from delays to vehicles while vehicles in their own 
generation are departing, 


while those in the second arise from delays while the previous 
generation is departing. 


The detailed algebra required to obtain the expectation of the terms in (16), averaged 
over the distribution given by (15 


by ), will be omitted, but may he obtained by noting, 
for example, that the probability of a particular a, 5, с, 4 and е, followed by unspecified 
frequencies, is есю фа+б+е+4 piyaabbeedqel(io 1 с! dl e! ). 
We find that, for example, 


Ш 


E(e(e— 1)) = 7(р8 +... + p3) +7201 


E(de) 7(р5+ ... +08) +7292. 
We finally obtain the expected total delay as 


ЕЛ ЕСЛИ 
He) = ну 


= p 
VQ = Mc, 


== 2p,-1 
y(r) = 1 А l g 
| y(r) mja – рају + 1-5,4n | . 
ternatively, it may be shown quite sim 1 
ply that the joint m.g.f. of a,b 
(еее...) аи 


exp (— p17 +10 exp (t, + 0 ex 

d P (tg +O exp (...)))). 
(This is closely related to expressions given by Good, 1948 ) Re ))) | | 
а power series in ¢,, t, ... then leads to the same value of vr) pansion of this function as 


and 


Thus 


is equal to 
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Hence, averaged over the distribution of r, the expected total delay to V,’s during a ГВ, is 
Зр1—1 М i «| (afd — a. 7 

М esum e rem | + tent nA n s (17) 


Expansion of the generating function (3) gives 
(е) = дуујбу + 9 ats (5/93 
and ` tlr) = ду ујда + дуд la (0) + Ч) д. 
Substituting these expressions in (17) and the expression in (11) for /41(4;) then gives the 


expected total delay as 


in ( d 

ee BAR... + 6, 03 + €201) + 91 (1 — ру — P2) из()]. 

=з ли е Ре 

Collecting our results together, and substituting in (12) and (13), we find that the mean 


delay to V,’s is 


1 ду alte) (1 Ae | 1 
- | 8 
н 2(1—р,) Шы Y o 69s + E201 (18) 


1 So kolt) (l= 01 — 02) 
„= ж +++ vated, T (19) 


ё| 


Similarly 
This completes stage 4. 


4. ExPLICIT SOLUTIONS FOR SOME SPECIAL CASES 
As already mentioned, the analysis of $3 leads to equations for the mean delays whose 
solutions have not been obtained. If, however, we assume one of the f’s to be zero, an 
explicit solution can be found, and we now consider this case. 
4-1. Special case в» = 0 


In this case, equation (10) reduces to 


а-Т р 
MT) = dg — Tet” (20) 
91 +92 
80 that M, = И(–%) = кемане" ОГ) 


Substituting їп (9), and putting T = — ду, leads on simplification to 


d1 de 
м (22) 
а at exo (а +) ду — (0 — Т) ша ШӨ 

1 


1 + (0 — 1) ето 
where w is the smaller real root of — 
Expansion of equations (9) and (20) leads to 
ва — Pa)? Halts) = 2601 — Pa) qx (1 + €292) (6; — 9%) 
+ 28,94(1 Ра)? (1 +в! 91) (в, — а + Ai) (23) 
+ дара (у 0162 + 8,61) (р1 + 23161 — 2аувуру) = Р, 
Golta(te) = 2(1 + €292) (6; — 2) 
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Hence, by (18) and (19), 


à. = рі 94(€292 + 1) (в — и») 
+ 24(1-р) | Феу +61, + €301) 
== pi ду e72%2(e%2% — oqo — 1) - (24) 
2411-рі) GY +6060) ” 
1 p 


and 


0, = XU - p у чей where P is given by (23). (25) 

These expressions for W, and %, are explicit formulae, since the only quantities they 
contain, apart from the basic constants, are у, д, and 0,, which can be expressed in terms of 
these by means of the relation (21) and (22). However, they are obviously unsuitable for 
tabulation and it is not thought that any important simplification can be made to them. 
This special case will therefore be left in this rather unsatisfactory state. 


4-2. Special case A= В. = 0 
From (24) and (25) we have 


= _ д 16703 (25212 — oi, Jn — 1) 

GY + 618, + €201) 

ds ни 
dily + € 5g 4-650) 


Using (21) and (22) we find 


(41--4 2) (gett) + саға) 
y d» 7 
(qo ға exatay) (41 +9 eaa) 5 


8, = (41 +92) aerated _ (9, +95) даје“ т+т) 
еки 1 META eaa) ^ = ath сока) › 
- 6 (0, +9 eaa) ers Фф - 1 
BENT — еза + 1) + 4, 6728 (een s — parte + 1) da = (26) 
and M бат (ду + qs etie) ei — 0—1 Е 
фу "eth (elarta) 2 — eara 4 1) + ҮІ СЕЗЕТІЛЕНГІГІ Е 1) di о“ (27) 


Table 1 below tabulates 27 W, for the case a, = a, = 1, for Ч; = 0-0 (0:5) 2-0,4, = 0-0 (0-5) 2-0. 


Table 1. Values of ©, for f, = By = 0, од 


-оОә-1 
91 
0-0 0-5 10 15 2-0 
0- H 
os | m | mom | ош | ош 
ГА 2 915 0-61 б 14 er 
i es 11 0-97 0-84 020 
1-87 1-67 1-47 1-27 
ч 


a 


І 
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4-3. Special case a, = f, = 0 


In this case we have М: =1, 4,295 €4—0; yak 
Hence by (24) and (25), 


a= рі ; 
17 20,17 p)" (з) 
1 
05 = — — qof 2p, — 8p? + 20% + 24161 (1 — py) – 29, 04(1 — 01? 
2 20 - pg o pi — 3pi 2p} 2916 (1 — py) – 20101 (1 — 37] 
eam 1 - p + 2p} 


(29) 


пр) ^ q (01-Р) 
This case corresponds to a physical situation in which vehicles in stream 2 do not affect 
those in stream 1 in any way. % is therefore the mean delay for a simple queue with arrivals 
qı per unit time and service time f. 
Table 2 below tabulates 10, for c = 1, Ё, = 0:0(0-1) 1-0, gı = 0-0 (0-2) 2-0, for values 
of 10, less than 10. 


Table 2. Values of ts for the special case & = Въ= 0, & = 1 


А, 

00 | 01 | 02 | оз | 04 | 05 | 06 | 07 | 08 | од | то 

oo | о-оо | 0:00 | 0-00 | 0:00 | 000 | 0:00 | 0-00 | 0:00 | 0-00 | 0-00 | 0-00 

03 | о | 011 | O11 | 0-11 | 0411 | 012 | 012 | 0-13 | 013 | 014 | 0-16 

04 | 0.23 | 0:23 | 0-23 | 0-24 | 025 | 097 | 0-30 | 034 | 0-38 | 046 | 0-56 

06 | 0.37 | 037 | 038 | 040 | 043 | 049 | 0-57 | 070 | 092 | 1-27 | 1:88 

a | ов | 0:53 | 0-54 | 056 | 0-60 | 0-67 | 0-83 | 1-04 | 137 | 2-37 | 4-40 | 10-00 
1.0 | 072 | 0.73 | 0-76 | 0-84 | 099 | 1-30 | 1-06 | 3-59 | 891 | — * 
1.2 | 0.93 | 0-95 | 101 | 115 | 144 | 215 | 419 | — | — * * 
1.4 | 118 | 121 | 130 | 154 | 212 | 386 | — = * * * 
1.6 | 1-47 | 151 | 1:66 | 205 | 319 | 833 | — * * * * 
1-8 | 1:80 | 1:86 | 208 | 273 | 510 | — * * * * * 
20 | 220 | 227 | 260 | 356 | 920 | * * * * * * 


* p,>1; no stationary solution exists. 

In this special case, it is possible to obtain an expression for the m.g.f. 24, (Т) of the dis- 
tribution of the delay 20. We proceed as follows: 

If a V, arrives in a VB, of length ђ, its delay will have a rectangular distribution in the 
interval (0,1). The m.g.f. of this delay will therefore be 

(eh? —1)/t, T. 
Averaging over the distribution of 4 gives 
8007 — 1) ТЕЬ] = (28027) – ШИТЬ). (30) 
is always unity, we have 
реба) (7-4) (у — T) 

MT) а di —Т— уд, + руде Pv Pa) » 


Now, by (7), and since 7 


еќа1-й) 1 1 
| "E ИЗ (31) 
So that p) ni-m] а 
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If a V, arrives in a V Bg, it will have zero delay. The probability of this is 
19 1—pi 


балала жа gn: (32) 
Hilt) + Па, deco 
since V B,’s and V B,'s are equally frequent. 
From (30), (31) and (32) we obtain 
м.т) (1-р) (— T) (vm —0 - T) (33) 


= Тева уду q,— T 4- vq 000-0 ү" 


4-4. The special case f, = а, = В = 0 
The solution for this case can be derived quite easily from the last section. We obtain 
(а — T) етт 
еа — Te-aT › 
uu (34) 
and Ww, = 0, 


МТ )= 


W, = (eh — оду — 1)/q. 
This case has been considered in more detail by Garwood (1940), Raff (1951) апа Гаппег 
(1951). 
5. APPLICATIONS 


In this section we consider briefly some particular applications of the preceding analysis. 


5-1. Delays to pedestrians crossing a road 


Suppose a stream of vehicles, flow q, per unit time, is passing a bottleneck where they are 
constrained to pass in a single line, with minimum separation £}. At this bottleneck, pedes- 
trians, who demand a gap «, between themselves and the next vehicle to arrive, are crossing 
the road. Then their average delay is given by (29) and Table 2. This can be seen by noting 
that both for the situation just stated and for the solution of the general problem with 
947 [47 0, & = д, = 0, vehicles in stream 1 are unaffected by those in stream 2, while 
vehicles (or pedestrians) in stream 2 wait for a gap а; in stream 1 and do not hold it up. 
(It is assumed that the time taken by a pedestrian to cross the road is less th 
vehicles are not held up by pedestrians.) 
is given by (34). 


an од, so that 
When there is по bottleneck, i.e. В, = 0, the delay 


5-2. Delays to vehicles on minor road at uncontrolled intersections 

If the suffix 2 refers to vehicles using the minor road, 
have to cross or merge with just one main road stream, vi 
right of way, we have to consider the situation œj > 2, > 


reasoning holds as in the previous paragraph.) This is a specialization of case (iii), as defined 
earlier, and the solution has not been obtained. If, ho ; 


and wever, the minor road traffic flow is 
small, it will not affect the result much if we put 0, = 0, in which case the mean delay is 
given by (29), as for the pedestrian case above. d 


and we suppose that these vehicles 
ehicles in the latter having absolute 
0,2 = 0, 8, > 0. (The same type of 


5:3. Delays to vehicles at uncontrolled intersections where 


neither stream has absolute right of way 
If we consider just two streams of traffic i i 
ilie intersecting аф right angl i 
m ‹ gles, the math tical 
виа 4 age of case (i), од >рі>0, а, > > 0. The solution for this has not ү iae 
ut, as m the previous section, we can give the mean delays if 2, and В» (or just one of йн] 
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are negligible, or if one or both of the flows is very light. The delays are then given by (24) 
and (25), or by (26) and (27). 


5-4, Delays to intersecting streams of pedestrians 

There are a number of similar problems under this heading. 

If we have a staircase, passage, doorway, gangway, bridge, etc., on which one-way traffic 
is enforced, we have a problem mathematically identical with the vehicle delay problem in 
terms of which this paper has been formulated. The simplification 2; = fa = 0 may more 
often be appropriate here than in vehicle problems. 

There will in practice often be complications in these situations, such as multi-channel 
operation, non-random arrivals, and so forth. An interesting, though no doubt very difficult 
problem, which has its counterpart in the vehicle delay problems, is to find what happens 
if instead of having single-channel operation we have several channels which could be 
Operated some in one direction, some in the other, at the same time. Passing from one 
channel to another in the middle of the passage through it might or might not be permitted. 


This paper was prepared as part of the programme of the Road Research Board of the 
Department of Scientific and Industrial Research, and is published by permission of the 


Director of Road Research. 
The author wishes to thank Dr F. Garwo 


preparation. 


od and Mr D. G. Kendall for advice in its 
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72 Tables of the angular transformation 


Table 2. Angular transformation of percentages less than two 


100р | 000 001 002 003 004 005 006 007 008 009 |1 2 3 4 5 
0-0 | 0327 0960 1222 1423 1592 1741 Ж Р CU 

01 | 2327 2.425 2518 2608 2-694 2777 | | See note oa 
0-2 | 3156 3.226 3295 3361 3427 3491 6 19 19 236 32 
0:3 | 3-793 3850 3907 3,962 4017 4071 5 11 16 22 27 
04 | 4330 4380 4499 4477 4595 4573 5 10 14 19 24 
0-5 | 4803 4-848 4892 4935 4979 5:022 4 9 18 iv 22 
0-6 | 5231 5272 5312 5-352 5392 5432 4 8 12 16 20 
0-7 | 5-625 5663 5100 5-737 5774 5811 4 л п 16 18 
0-8 | 5-991 6027 6062 6097 6132 6166 |3 тло dà 11 
09 | 6336 6369 6403 6436 6469 6-501 з т 10 18 16 
10 | 6662 6694 6.725 6-757 6-788 6819 з 6 9 12 16 
ГІ | 6-973 7-003 7033 7063 7093 7-122 3 6 9 12 15 
12 | 7-269 7-298 7-327 7-356 7-384 7-413 з 6 9 n м 
L3 | 7554 7.582 7609 7637 17.665 7-692 3 5 ви M 
14 | T828 7-850 7-882 7-908 7-935 7-961 з а 58 BD B 
15 | 8003 8118 8144 8170 8196 8-222 3 d W 10 19 
16 | 8349 8274 8399 8494 8440 8474 2 в 7 10 12 
17 | $597 8:621 8646 8670 8691 8718 2 Б т 10 12 
18 | $838 8862 8885 8009 8933 8956 2 5 T7 9 12 
L9 | 9073 9-096 9119 9142 9165 9-188 2 т бё A 


Interpolate linearly between tabular values when 0-05< 100р < 0:2. When 100p < 0-06, use the formula, 
Habits +6-325 Үр. For values of 100p greater than 98, enter with 100(1— p) and subtract the tabular value 
rom . 


Table 3. Angular transformation of proper fractions 


*1/30 | 11-989 | *1/10 | 20-676 | 5/27 | 98469 | вә 5- 

1/29 | 19140 3/29 21-037 | 3/16 | 28-650 0/18 36486 
1/28 | 12352 | 2/19 | 21-225 | 4/21 | 28-801 | 7/25 | 35.599 
127 | 12-575 | 3/28 | 21-418 | 5/26 | 29-038 | 2/7 | 35.994 
1/26 | 12811 | пе | 21-820 | *1/5 | 29-651 | 7/94 | 36400 
125 | 13-062 | 3/26 | 22247 | 6/29 | 30192 | sn 36-5 
1/24 13:328 | 2/17 | 22-470 | 5/24 | 30-304 9/27 36731 
1/23 18-612 3/25 | 22-700 | 4/19 | 30-475 | *3/10 | 36-987 
1/22 "915 | 1/8 | 23182 | зла | 30-766 7/23 | 37-286 
| 14240 | 3/23 | 23-697 | 5/23 | 31.004 4/13 | 37-516 


9/20 46-832 
5/11 | 47-121 
11/24 | 47-362 
6/13 | 47.565 | 
13/28 | 47-739 ( 


#7115 47-890 
8/17 | 48:139 
9/19 | 48:335 
10/21 | 48-494 
1/20 | 14-590 тақ sss 
. "2/15 | 23-968 | оо | 31.373 2 . 
ЕЧ ЕЛЕН ЕЕ 
MIT | 16-821 | 1/7 | 24841 | «pao | 22019 | 8/19 | 38-068 
0 " 7 
116 | 16308 | «рт | 25:315 | а; зза alas 38.354 
*1/15 | 16-844 | 3/20 | оз. 
2/29 | 17-133 39 5319 | ЫШ | 32566 


12/25 | 48-735 
13/27 | 48-899 
14/29 | 48-909 
*1/2 | 50-000 


9/28 ; 
ун | ИЗ | gig | sears | Sas ЧЕЧЕ 
2121 17:760 | 495 | 96-354 | 1/4 23405 10/29 10-021 
| 18101 | *1/6 26-924 7/27 | 34-114 өле 40109 
2/25 | 18463 | 5/99 


27408 | 6/23 | за. 
105 | 18:848 | 4128 | 27.584 Bite Sd 1/20 | 40365 
БУРЕ | SE) 
2/ | 20169 | 211 | 25180 | 196 | 34830 “026 


Қ 9/25 г 
ШЕЛІ 
, For fractions exceedin 5 
м mene one-half, subtract the fraction from unity and the angle from ћ 
one hundred, 


м. 
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None of these advantages can be considered as overwhelming, but, since a new tabulation 
was to be undertaken, they were sufficient to lead to the choice of this form for the trans- 
formation. As an alternative, we also considered taking 0 as one-tenth of the expression 
above but the resultant table would look so much like a table of probits that it would not 
be at all difficult to enter one in mistake for the other. 

Table 1 gives 0 for p = 0 (0-1) 50 with proportional parts for linear interpolation. If 
100p > 50, enter the table with 100(1—p) and subtract the tabular value from 100. For 
example, if 100p = 62-77, we enter with 37-23 and finding that 0 = 41-833 obtain for the 
required transformed variable a value of 58-167. Table 2 gives a finer tabulation for the first 
2%, = 0(0:01) 2 95, and hence for the last 2% of the range, although we note that it 
will not usually be profitable to use the angular transformation when 7 is close to 0 or 100. 
Table 3, following the pattern of the original table in Fisher & Yates, gives the angles 
corresponding to proper fractions with denominators not exceeding 30. 

The writer hasalso calculated a tableof angular valuesfor adjustments of special accuracy, 
analogous to Table XIV given by Fisher & Yates (1938). This table is not reproduced here. 

The tables were computed with the aid of British Association Mathematical Tables, 


vol. 1 (1931). 
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TESTS OF SIGNIFICANCE IN A 2x2 CONTINGENCY TABLE: 
EXTENSION OF FINNEY’S TABLE 


Сомротер ву В. LATSCHA 


Institute of Actuarial Science, University of Berne 


EDITORIAL FOREWORD 
has given a table which may be used to test the significance of the deviation 
onality in any 2 x 2 contingency table having 
its margins less than or equal to 15. The table printed below 
table up to marginal frequencies of 20. As the interpretation 
exactly similar to those of the 1948 table, only a brief introd 

Using Finney’s notation, the contingency table should b 


Finney (1948) 


from proporti both the frequencies in one of 


extends the range of Finney’s 
and uses of the new table are 
uctory statement is required. * 
e arranged in the form 


Number of 
Т Total 
Successes Failures 
Series I a A-a A 
Series II b B-b B 
т=а+ь A+B-a-b A+B 
L 


is that which makes a/A >b/B. The table of significance 
levels 1s arranged in sections according to the value of 4 ; the sections for А = 9 (1)15 were 
given by Finney, while those for A 


ж АРУЫ 


Significant effect at that 


are available as a Biomet 
form, Finney’s table, 
Cans now at Press, 


| y А arie rika ‘separate’ a 
Biometrika Tables for Statisti 


nd the pr 2 р 
but not the extensi © present extension will 
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As an illustration, we may use Lange’s data on criminality among twin brothers ог 
sisters of criminals (Fisher, 1946, §21-01). This example was taken by Finney (1948), but 
as А > 15 he used it to show how his table could be extended under certain conditions. As 
A < 20, direct entry is now possible in Latscha's table. у 

The contingency table shows the number of twin brother or sisters of criminals who had 
also been convicted of crime, classing separately monozygotic and dizygotic (but like- 
sexed) twins: 


| Not convicted | Convicted | Total 
| an | 
Dizygotic | 15 (=а) 2 17 (= 4) 
Monozygotic 3 (= 0) 10 | 13 (= В) 
Total 18 12 | 30 


Following the rule given above, the letters A, B, a and are associated with the observed 
frequencies as shown. The null-hypothesis is that the twin of a criminal is no more likely 
to be convicted of crime if the twinning is monozygotic than if it is dizygotic. If the only 
deviation from the hypothesis which we are prepared to consider is that monozygotic 
twins behave more similarly than dizygotic, a single-tail test will be appropriate and we 
shall ask whether a/A = 15/17 is significantly greater than 6/8 = 3/13. 

Turning to the appropriate section of the table with A = 17, В = 13 and а = 15 we find 
that the observed value of b = 3 is significant at the 0-5 % level, since it is less than 4, the 
last entry in the row of bold figures. 

The figure in small type following b = 4 indicates that for a contingency table with 
Marginal frequencies 

A=17, В=13, r=a+b=154+4=19, 4-В-а-%-П, 


the conditional probability of an arrangement within the table with 0 < 4 is 0-002 on the 


null hypothesis of independence. The probability that b «3 within the observed table 


(having a+b = 18) is not recorded, but is < 0-002. 
As far as possible, checks on the internal consistency of the table have been made as 


Well as comparisons with the more extensive tables for the special case A = В published 


by Swaroop (1950). 
REFERENCES 
уме ‚ Biometrika, 35, 148. | | 
нду, т Bosse Methods for Research. Workers, 10th ed. Edinburgh: Oliver and Boyd. 
Swanoor, 8. (1950). Indian Med. Res. Mem., no. 35. 


Significance tests in a 2 x 2 contingency table 


— 
Probability 
Probability 
a | 
| 0-005 
a 0:05 | 0025 0-01 
0:05 | 0025 | 001 | 0-005 
002 
= 94 | 8:04 | 7-0% | 6 
E: Eun open. ee | м | Reis БЕДА Бра 6 013 | 5 04 | 5 +004 
A=16 B=16| 16 15 | 7 056 0057 
15 |10 o1 | 9-019 | 8 ов 7 14 | бою | 5 o1s~| 4-иө-| 4 005 
14| 8.07 | 7 012 | 6 00 : Rs 13| 5-039 | 4-014 | 3 00 : in 
13 | 7.033 6 :015- | 5 -006 4 12 | 4 ом 3 012 2 -003 1 ste 
12 | 6-037 | 5-016 | 4-0% Es 11 | 3.027 | 2-0 | 2 008 EC 
11 | 5 038 | 4-016 | 3 -006 002 10 | 2-019 | 2-019 | 1 -oos-| 1 i 
10 | 4 -037 3 -015- | 2 .005- | 2 -005 9 | 2 -040 1 01 0:02 | 0 0 А 
9 | 3 өз 2 -012 1 -003 1 -003 a| Та 1024 0 -004 0 00 
8| 2-027 | 1-008 | 1-008 | 0 001 7| 1.048 | 0 .010-| 0 -o10-| — 
7| 1.09 | 1-019 | 0 -003 | 0 0 &| 0-4 | 0 ar |— — 
6| 1.041 о 009 | 099 | — 5| 0-044 | — — = | 
5 | 0022 22 | — => 
15| 16 |11 ов |10 ов | 9 -007 | 8 ог “е | вв | 3-007 
15 | 9.033 | 8 -o14 | 7 005*| 6 -002 п Е 1 bit bos 5.09 | 49? | 
14 | 8 044 | 7 019 | 6 008 | 5-003 14| 5 027 | 4 -009 | 409 | 3 07 
13 | 6.023 | 6.023 | 5 99 | 4 003 13| 4-024 | 4 .024 | 3 оов | 2 -002 
12 | 5024 | 5 -024 | 4.00 | 3 -003 12 | 3-019 | 3-019 | 2 -oost| 100! 
B cae | eme | ape) ам 11 | 3:04 | 2-013 | 1.003 | 1003 
10 | 4-09 | 3 20 | 2006 | 1 -oo1 0| 2-028 | 1-007 | 1007 | 0 00 
24549 | 2 ІК dem ; 1 :016 1.016 | 0.002 | 0002 || 
8| 2 035 | 1 010*| 0 00 | 0 -002 9 | 0 00 | — 
à j 0 -004 8| 103 | 0 006 
7| 1:03 1 023 0 -004 = 
7| 0 013 0.013 | — | 
6 | 0и 0 оп | — — n == 
5| 0.026 | — = d 0: 10% | | 
14 | 16 |10 өз | 9 о4 | 8 -oos+| 7 -002 .004 | 5 -004 
15 | 8 -025+| 7 -o10-| 7 -o10-| 6 «ооз 10 | 16 | 76 | бом я кл 3-0: 1 
14 | 7 032 | 6 013 | 5 .005-| 5 .005- 15 | 5-018 | 5 8 3 005-| 3 0057 | 
13 | 6 035+ | 5 14 | 4 -005+| 3 001 14 | 4018 | 4 018 2.03 | 2 00 
12 | 5 -035+| 4.014 | 3 -005-| 3 -o0s- 13 | 4 042 | 3 -014 2 08 1 002 | 
11 | 4.033 | 3.02 | 2.004 | 2 004 12 | 3.02 | 2009 1 005-| 1057 || 
10 | 3 02 | 2-009 | 2-009 | 1 -002 П| 2 021 | 26%и 0 00 | 0 002 
9| 2-021 | 2-021 | 1.006 | 0 от 10 | 2042 | 1 ou O ои | 0 004 |. 
8| 2.045-| 1.013 | 0-0 | 0 -002 9 | 1.023 | 103 po > 
7| 1-030 | 0-006 | 006 | — 8 | 1.045-| 0 -008 0 A 
6| 0-013 | 0 из | — — 7| 0-017 | 0.07 | — i 
5 | О | c = 6| 0.035-| — = 
13 | 16 | 9 озо | 8 on 7 9% | 7 -004 -| 4 002 | 
15 | 8 07 | 7 о | 6. 5 -002 16 | 6 07 | 5 ою-| 5010 3 +003 
14 | боз | боз | 5 оов | 4.05 15 | 5-040 | 4 012 | 3 Ба 2 -002 
13 | 5 23 | 5 оз | дов | 3 -003 14 | 4.034 | 3 -o10-| 3 -o10 1 01 
12 | 4 022 | 4 022 | 3 007 | 2,002 13 | 3 -025+ | 2-007 | 2 -007 1 -003 
11 | 4 ов 3 -018 2 -005+ | 1001 12 | 2 -016 2 016 1 -003 0 9! 
10 | 3.05 | 2.013 1 -003 1 -003 11 | 2 оз 1 -008 1 -008 0 0? 
91202 | 1 оов | 1 008 | O -001 10 | 1.017 1.017 | 0 -002 
8 | 1.08 | 10% 0 003 | 0 -003 9| 1.034 | 0 -o06 | 0006 | — 
7| 1з | 0-007 | 0 ооу | — 8 | 0-012 | 0.4012 | — T 
6| 0-017 | 0 07 | _ — 7| 0-024 | бом |_ E 
5| 007 | — =, = 6| 0 -o4s+| — = ЕЕ 
The table shows: д] 
(1) In bold type, for given A, B and a, 


quoted (single-tail test). 
(2) In small ¢; 
or less than the 


sagd 
the value of b (<a) which is just significant at the probability 
ype, for given A, B and r= 


i'd 
а 
a+b, the exact probability (if there іні b is ed" 
integer shown in bold type. 3 са а а | 


{ 


| 


Significance tests in a 2 х 2 contingency table (continued) 


Probability Probability 
a a 
0:05 0:025 0:01 0:005 0:05 0:025 0:01 0:005 
А=16 B=g | 16| 5-028 | 4 007 | 4-007 | 3:1 |A=16 B=3 |16| 1-018 | 1 018 | 001 | 0 00 
15 | 4 -028 3 +007 3 -007 2 001 15 | 0.004 | O 004 | 0 004 | 0 -004 
14 | 3 ош 3 -021 2 905-| 2 4005- 14 | 0 010*| 0 -0ю% = 
13 | 3 047 2 -013 1 -002 1 -002 13 | 0 021 0 2 | — — 
12 | 2 028 1 -006 1 -006 0 -001 12 | 0 -036 | — E == 
11 | 1-014 | 1-014 | 0 -002 | 0 -002 : 
10 | 1.027 | 0 004 | 0 004 | 0 -004 2 |16 р 007 | 0 007 | 0007 | — 
9 | 0-009 | 059 | 0% | — z 0 =: 0110801 | == = 
8| 0-017 | 0 017 | — — < E m 
7| 023 | — — — 
А=17 В=17 | 17 | 12 02 |12 22 |11 00 | 10 ом 
7 |16| 4-020 | 4.020 | 3 004 | 3 004 16 | 11 -043 |10 020 | 9.008 | 8 -003 
15 | 3.017 3 017 2 -003 2 -003 15 | 9 -029 8 -013 7 :005+ | 6 -002 
14 | 3 :045+ | 2 01 1 -002 1 -002 14 | 8 035+ | 7 016 | 6 -007 5 .002 
13 | 2 -026 1 -005-| 1 -005-} 1 -005~ 13 | 799 | 6 018 | 5 007 | 4 -003 
12 | 1 -012 1.012 | 0 -oo1 | 0 901 12 | 6:02 | 5 09 | 4 -007 | 3 -002 
11| 1.024 104 | 04003 | 0 00 11 | 5.02 | 4 -018 3.00 | 2 -002 
10 | 1.045-| 0 ф7 | 09097 | — 10 | 4.040 | 3-16 | 2 005+} 1 -001 
9| 0-014 | 004 | — — 9| 3 -03s+| 2 -013 | 1003 | 1 -003 
8 | 0.026 | — = == 8| 2.02 | 1:08 | 11008 | 0 001 
7| 0.047 |— - — 7| 1.020 | 1.00 | 0 004 | 0 -004 
6| 1.043 0 -009 0 09 | — 
2 5| 0.022 | 0022 | — ә 
6 “013 2 -002 :002 
2 5 pes 3 009 2 009 1 -001 16| 17 | 12 -044 |11 -018 | 10 -007 9 -003 
14 | 2 :025+ | 1 -004 1 -004 1 -004 16 | 10 -035-| 9 -015-| 8 -006 7 -002 
13 | 101 1-01 0 -001 0 -001 15 | 906 | 8021 7-00 | 6 -003 
12 | 1.023 | 1-023 | 0 -003 | 0 005 14 | 7 .025+| бооп | 5 -004 | 5 -004 
11 1 043 0 -006 0 :006 | — 13 | 6 -027 5 оп 4 -004 4 -004 
10 | 0.02 | 0 0122 | — РЕ 12| 5-027 | 4:011 | 3 -004 | 3 -004 
9 | 0.023 | 0.023 |— Бы 11 | 4 025 | 3 909 | 3 909 | 2 -003 
8| 0 |= = a 10 | 3.022 | 3 о2 | 2907 | 1 -002 
9 | 3.046 | 2 017 1 -004 1 -004 
8 | 206 1 01 0 02 | 0002 
5 | 16] 3-048 | 2 08 | 2 008 | 1 001 7| 1.024 | 124 | 0-05) 0 005- 
15 | 2.028 | 1-004 | 1-004 1 -004 6| бои | 0п |— == 
14 | 1 оп | 10и | 0001 | 0 001 Sij (0026 |= E = 
13 | 1 025+] 0 003 | 0-юз | 0 00 15| 17 | 11 -038 | 10 0157 | 9 006 | 8 002 
12 | 197 | 0 06 | 0 006 | — 16 | 9-027 | 8 -o1 | 7.004 | 7 ом 
11 | O -012 0 012 | — = 15 | 8 035*| 7 -015-| 6 -006 5 -002 
10 | 0 өз | 0023 = 14 | 7-04 | 6 017 | 5 -006 | 4002 
9| 00» |— oa Б 13 | 6041 | 5 өл | 4 906 | 3 0 
12 | 5 4039 4 -016 3 -o0s+| 2 -001 
4 004 11 | 4 935%| 3 -013 2.004 | 2 -004 
bit ges 1 n 0 кең 1 "001 10 | 3 o» 2 -107| 2 -010-| 1 -002 
15 | 1 013 г pos Q0 | 0 00 о | 2-022 | 2-022 | 1-006 | 0 00 
n б ы Q7 | 0 007 |— 8| 2 046 1 014 | 0.002 | 0 -002 
E. == 7| 1 -030 0 -006 0 -006 | — 
12 | 0-04 | 0 014 ы 6| 0-014 | 0 04 | — = 
11 | 0 026 | — = 5 | 0-01 |— “е ең 
10 | 0.043 | — Қаса = 


Significance tests in а 2x 2 contingency table (continued) 


Probability Probability 
a a 
0:05 0-025 0-01 0-005 0-05 0:025 0:01 0:005 
A=17 B=14| 17 | 10 032 | 9-012 | 8:004 | 8 ом |А=17 В=11 | 13 | 4-042 | 3.014 | 2.004 | 2 004 
16 | 8-021 | 8.021 | 7-08 | 6 -003 12 3 031 | 2-009 | 2 009 1 002 
15 | 7.026 | 6 00-| 6 0107| 5 -003 11 2.020 | 2 -020 1 :005- | 1 0057 
14 | 6.028 | 5 боп | 4 4004 | 4 -004 10! 2-040 | 1 0и | 0 -001 0 00! 
13 | 5 ол | 4 010-| 4 -o10-| 3 -003 9| 1.022 1022 | 0 -004 0 004 
12 : :024 4 -024 3 -008 2 -002 | 8, 1 -042 0 -008 0 | — 
ш ome lama lias | ји Мн 13 
" x “ -003 К — => € 
9 | 2029 1 -008 1 -008 0 -001 E | uo 
8| 1.08 | 1 ois . E 
7| 1:08 | 0 -007 о in ды T 10| 17 | 7 04 6 o2 5 3003 aes 
6 бау 05 li = 16 | 6 047 | 5 -o1s+| 4 004 | 4 004 
Б! oms 1 = = 15 | 5-043 | 4-014 | 3 004 | 3004 
14 | 4 ом | 3 ою%| 2-002 | 2 992 
13 | 3.04 | 3 -024 2 -007 1 -001 
13 П 9.06 | 8.09 | 8 -009 | 7 -003 12| 34 | 2 015+ | 1 ооз | 1003 
М р се 7 015+ | 6 005+ | 5 -002 11 | 2-031 | 1-007 | 1-007 | 0 0 
AL = 6 -018 5 006 | 4 -002 10 | 1 -016 1 016 0 -002 0 +002 
212 — : 018 | 4-06 | 3 -002 9 | 1-031 | 0 -05%| 0 -005+ | — 
wll IE A 3 016 | 3 005*| 2 -001 8 бои 091 | — == 
ola = 3 Es ; = :004 71 0 022 | 092 | — -- 
` С ғ 002 . — = 
10 | 2 -o19 2 -019 1 -005– | 1 -005- тр - 
9 | 2 -040 1-01 0 -002 4 
mus | qum | 0 0 = 9 |17| 6.032 | 5 оов | 5-008 | 4 9? 
7| 1.047 | 0 -010-| 0 -o10-| — 16 | 5.034 | 4 -010-| 4 -o10-| 3 90? 
6| 0.021 | 02 |— ТЕ 15 | 4-028 | 3-08 | 3.008 | 2 902 
5 | 48 | m m 14 | 3.00 | 3.020 | 2 005-| 2 905 
13 | 3-042 | 2 012 | 1:02 | 1 -002 
121 17 | 8 02 | А | 12 | 2 05*| 1 -006 1 -006 0 оо! 
| fae | eam || ан | gam A) тш так EET. 
15 | 6 -033 5 -012 4 -004 4 m 4 les ыы ее цен 
14| 5з | 4 оп | 3-003 | 3 m a) Ave mam] p | 
| 13 | 4 026 | 3 оов | 3-0 | 2-002 S Oae 08 | — E 
12, 3.00 3 -020 2 -006 1 -001 T| 000 | — = pt 
11| 354 | 2-013 | 1-003 | 1- | 
10 | 2-028 | 1+ — 8 |17] 5. оо! 
+ у 7 | 1-007 | 0-0 024 | 5.04 | 4:06 | 3 
8$ 1o | | 0 0 | 0 16 | 4-023 | 4 оз | 3 006 | 2 09 
7| 0 002 1 006 | 0-06 | — 15 | 3 017 | 3 017 | 2 004 | 2 e 
pss 012 | — _ 14 | 3-039 | 2 -o10-| 2 -o10-| 1 js 
== T E 13 | 2022 2 -022 1 -004 1 A 
12| 2-043 | 1 o0-| 1 -o10-| 0 ? 
11 4 | 
| 16 | ces | 706 | 6 205°] бу. и | 1-020 | 1-020 | Оз | 09%) 
| 6 022 | 5 097 4 | 10 | 1з | 0-06 006 | — 
15 | 5 22 | 5.95 4 002 | 0 
| doi | 4 4 007 | 3 002 | 9 0-012 | 04012 | — = 
‘019 | 3.0% | 2 0 8| 0:22 | 002 | — = 
The table shows: | ý | SES ps d 
(1) In bold type, for 


quoted (single-tail ipm A, B and a, the value of b 


(2) In small type, for gi 
ype, given A, B = 
or less than the integer shown in riim 
e 


--—«»—masmümsma ci  __ 


< TP ! 

и" (<a) which is just significant at the probability lev? 

“+, the exact probability (if there is independ quel "Á 
С ndence) that b is € 


Significance tests in a 2 x 2 contingency table (continued) 


Probability Probability 
a ] a 
0-05 0:025 0:01 0:005 0:05 0:025 0:01 0:005 
А=17 В=7 |17| 4017 | 4-017 | 3:00 | 303 |А-18 B—18| 18 |13 023 | 13 023 | 12 o10-| 11 ом 
16 | 3-014 | 3 014 | 2 00 | 200 17 | 12 044 | 11 020 | 10 009 | 9 -004 
15 | 3038 2.09 | 2-009 | 1 00! 16 | 10 0: | 9 014 | 8.006 | 7 -002 
14 | 201 2 021 1 004 1 -004 15 | 9 -038 8 018 | 7 -008 | 6 -003 
13 | 2 -042 1 009 | 1 -009 0 -001 14 | 8 05 7 -020 6 -009 5 -003 
12 | 1-018 | 1-018 | 0 002 | 0 002 13 | 7.046 | 6 022 | 5 99 | 4 -003 
11 | 1 -034 0 -005-| 0 -005- 0 -0057 12 | 6 -047 5 022 | 4 -009 3 -003 
10 | 0 -107| 0 0107| 0 0107 — 11 | 5 -046 | 4 -020 3 -008 | 2 -002 
9| 0.09 0.019 | — — 10 | 4 -043 3 018 2 -006 1 -001 
8 | 0.033 |= = = 9 | 3 -038 2 -014 1 -004 1 -004 
| 8| 2.030 1 -009 1 -009 0 -001 
6 |17| 301 | Зои | 2-002 | 2002 7: 1-020 | 1-0 | 0-04 | 00 
16 | 3-040 | 2 ов | 2 008 | 1-001 6| 1-044 | 0 о10- 0 -o10-| — 
15 | 2м | 2-021 | 1-003 | 1 -003 5 | 0-023 | 0 өз | — - 
14 | 2 045*| 1 -009 1 009 0 -001 
13 | 1.08 | 1408 | 0 002 0 002 17 | 18 | 13 045*| 12 -o19 |11 +008 | 10 -003 
12 | 1.035-| 0 -oos-| 0 %05- 0 -005~ 17 |11 06 10 016 | 9 00 | 8 00 
11 | 0:09 | 009 | 099 |— 16 |10 99 | 9 023 | 8 01-| 7 00 
10 | 0 «017 0 017 |— = 15 | 8 -028 7 012 6 :005– | 6 -005- 
9| 009 | — a as 14 | 7-030 | бооз | 5 405%| 4 -002 
8 | 0 -oso-| — = € 13 | 6 03 | 5-013 | 4 405-| 4 -0057 
12 | 5 0» 4 -012 3 -004 3 004 
5 | 17] 3.043 | 2-006 | 2-006 | 100! I| 45028 рз коок 2:285. | notos 
16 | 2.024 | 2024 | 1-003 | 1-03 10 | 3-023 | 3 023 | 29% | 1-00 
15 | 1-009 | 1-009 | 1 00 0 001 9| 3an | 2486 | dons] 00s 
14| 1 | 1.021 | © 02 | 0 002 далары өш 
13 | 1-039 | 0 -oos~| 0 057 0 005 i : ed 1 p 0 -005— | 0 -005- 
12 | 0 -o10-| 0 -010-| 0 0107 — 5| 0 ins — в = = 
11 | 0.018 0-08 | — = 
10: | 10-080 || == am "E 16| 18 | 12 0:9 | 11 0:16 | 10-006 | 9 002 
9| 0-00 |— = = 17 | 10 029 | 912 | 8 -005-| 8 -005- 
16 | 9.038 | 8-017 | 7-07 | 6002 
4 | 17] 2-029 | 1-003 | 1-003 | 1 -003 15 | 8-043 | 7 019 | 6 -008 | 5 -003 
16 | 10и | 10и | 0% | 000 14 | 7-046 | 6.020 | 5-008 | 4 -003 
15 | 1.02 | 0.03 | 0 003 | 0 -003 13 | 6 -04s+| 5 -020 | 4-007 | 3 902 
14 | 0.006 | 0 -006 | 0 006 | — 12 | 5-042 | 4-018 | 3 -006 | 2002 
13 | 0 012 | 0-02 | — a 11 | 4 -037 3 -o1s-| 2 -004 | 2004 
12 | 0 02 0 021 | — әк 10 | 3.031 2 01 1 -003 1 -003 
11 | 0 -035+ | — — E 9 | 2 > 2 -023 1 -006 | 0 -001 
| ,8 | 2 -046 1 -014 0 -002 0 -002 
з | 17 1 1-016 | 1-016 | 0-0 | 0 001 "7 | 1.030 | 0-06 | 0 +006 | — 
16 | 1-046 | 0.004 | 094 | 0004 6| 0-014 | 0 014 | — = 
15 0 -009 0 -009 0 009 — 5 | 0-031 | — = = 
14 | 0-018 | 0:08 |— E 
13 | 0-031 | — = — 15118 11933 | 10 013 | 9 -05-| 9 -005~ 
12 | 0 -049 = = 17 | 9.023 | 9 023 8 -009 | 7 -003 
16 | 8-029 | 7-012 | 6 004 | 6 004 
2 Р ме | 05% | — 15 | 7-031 | 6 013 | 5 -0057| 5 '005- 
~ ? E н pe жм = 14 | 6 o1 | 5-013 | 4 -004 4 004 
15. | oo |— = — 13 | 5-029 | 401 3 -004 3 -004 
"жаы, 


Significance tests in a 2x 2 contingency table (continued) 


Probability Tw 
Probability 
a 
a ' 4 0.01 | 0:005 
0-05 0:025 0-01 0:005 0:05 0-025 | 
E - ox | 100+ 0 001 | 0 00! 
ot ТЕ-ДЕ НІНЕ |“ ов 5 1 021 1 -021 | 0 -003 | 0 00 | 
11| 3 -020 3 -020 2 -006 1 -001 8 | 1 -040 0-07 0907 | — 
10 | 3 041 | 2-014 | 1.004 | 1 -004 2| бё | G6 = — 
9| 2 030 | 008 ^ 2 4 = al owi |— B | =: | 
8| 1 018 018 i E 
038 | 0 07 | 0 007 | — р . 004 | 6 00 | 
кө А а | m рлеу До ҒЫН Т 
5| 0-036 | — — — i E pue 5 Е 4 005+ | 3 be 
ч = у 15 | 5.043 | 4 -o1s-| 3004 | 3 bs 
14| 18 | 10 928 | 9 -o10-| 9 0107| 8 -003 4.03 | 301 | 2-003 | 2 
17 | 9 043 | 8.017 | 7 -006 | 6 -002 14 ees 54H 2 007 1 00 
16 | 8.050 7 021 | 6 008 | 5 -003 13 302 5 1-003 | 1:00 
Р 2| 3 046 | 2 014 оо! 
15 | 6 02 | 6 022 | 5 008 | 4 -003 | coo | oor | Lao | (050 
14 | 699 | 5 20 | 4007 | 3002 11 -| 1.015-| 0.02 | 0 00 | 
13 | 5 044 | 4 017 | 3.006 | 2.001 10 pon o ams] 0 09-1 045 
12 | 4037 | $05 | 2-004 | 2 00 4 B enel aes = 
H с «8 25 ded Ede 7| 02 0 20 | — m 
10 | 2.020 2 -020 1 -005-! 1 -005- іы |2 E Y 
9| 2909 1 оп 002 | 0 -002 
8| 1.024 | 1-024 | 0 -004 | 0 -004 "rm 6 ow*| 5:0 5 T 
7| 1.047 0 -009 0 о | — ^ 4 00 4 0 
6| 0.020 | 0020 | — = 17 | бои | 5 «із 3 ооз | 3 99 
5 | 0.043 | — = = 16 | 5 96 | 4 оп 2, :002 
15 | 4-028 | 3 008 | 30% 2,0057 
13| 18 | 9.03 | 9.023 | 8-08 | 7 -002 | 14 | 309 | 3 о : bus 1 00 
17 | 8 ом 7 012 | 6 -004 6 -004 13 | 3 -039 2 01 1 005 | 0 00 
16 | 7.037 6 14 5 :005- | 5 -005- 12 | 2 -023 2 +023 б ой 0 00! 
15 | 6 -036 5 14 4 004 | 4 004 11 | 2 -043 1-01 ma 0 003 
14 | 5 о2 | 4 012 3 -004 3 -004 10 | 1 -022 1 -022 ea ен 
13 | 4 ол 3 -009 3 -009 2 -002 9 | 1 -040 0 -007 EJ 
12 | 3.00 | 3.020 | 2-006 | 1 001 8| 0.014 | 0 014 | — ж 
11 | 300 | 2 013 1 -003 1 -003 7| 0.027 | — = E 
10| 2-027 | 1-007 | 1.07 0 -001 6| 009 | — == 
9 | 1 -015+| 1 o15+| 0 -002 0 002 4 002 
8| 1-031 | 0 006 | 0-06 | — 9 |18) 6.029 | 5.007 | 5 907 3 002 
7| 002 | 0.02 |... = 17 | 5 озо | 4-008 | 4 -008 200 
6| 0 025+| — E. = 16 | 4 оз | 4 оз 5 bd 2 004 
12| 18 | 8 ов 214. us 15 | 3 16 3 aig 2 am 1 002 
:006 6 -002 14 | 3 -034 2, -009 1 004 
17| 746 | 6 ов | 64 | 5.95 13 | 2-019 | 2ш» | 1008 | 1% 
16 | 6 027 | 5.0% 5 99 | 4 .003 12 | 2-037 1 -009 1 -009 0 ‘90? 
15 | 5 ом 5 .024 4 -008 3. :002. m 1448 | 1 018 0 -002 
14 | 4 020 | 4 020 | 3 со 2 -0o01 10 | 1-033 | 0 005+) 0 005*|— 
13 | 4 042 | 304 2 а | 204 9| 0 010+! aH = = 
12. | 3 өш | 2% 2-009 | 1 о» $: «ox. 04% lis = 
ШІ 29 | 2ш» | 105] 1 "005- TI 10-%6 | — "= 14 
The table shows: 


(1) In bold type, for 
quoted (single-tail test). 
(2) Та small type, 
or less than the inte 


given A, B and a, the value of 5 (<a) which 


for given A, B and r= 
ger shown in bold type. 


a 


ГІ 


лику 10 
is just significant at the probabilitY 


1 
" ue 
+6, the exact Probability (if there is independence) that b i$ od 


Significance tests in a 2x 2 contingency table (continued) 


Probability Probability 
| a - 
| 005 | 0025 0-01 0-005 0-05 | 0025 001 | 0.005 
А=18 B=8 | 18 | 502 | 5 022 | 4 -005-| 4 05 |А=18 B=4 | 13 | 0 or | 007 | — = 
17 | 4-020 | 4 020 | 3004 | 3 -004 12 | 028 |= == а 
16 | 3.014 | 3-014 | 2.003 | 2 903 11 | 0 -o45+ | — и 2 
15 | 3 -032 2 -008 2 008 1 -001 
14 | 2 017 2 017 1 -003 1 -003 3 | 18| 1-014 1-014 0 00 0 -001 
13 | 2-034 | 100 | 1-07 | 0 0! 17 | 1-041 | 0-003 0-003 | 0 -003 
12 | 1.015+| 1 ө5%| 0 002 | 0 -002 16 | 0 -оов | 0-008 | 0:00 | — 
11 1 -028 0 -004 0 -004 0 -004 15 | 0-о05%) 0015 — = 
10 | 1-049 | 0 00 | 0 98 | — 14 | 0.06 | — == = 
9} 0-016 | 006 | — — 13 | 0 042 | — |— = 
02 a жана 
| - о 5s = > — 2 18 0 -005+ | 0 -005+ 0 -005+ | — 
17 | 0 -016 0 016 | — = 
7 | 18| 4 015*| 4 -015+ | 3 -003 | 3 -003 16 | 092 | — |— - 
17 | 3 012 3 012 2 -002 2 -002 
16 | 3 92 2 -007 2 -007 1-001 |A.19 B=19| 19 | 14 023 | 14 -023 | 13 -o10- " 
15 | 2 017 2 017 1 -003 1 -003 18 | 13 -045- | 12 21. |11 -009 12 bod 
14 | 2 034 1 :007 107 | 0 mi 17 |11 -031 |10 -015- 9 -006 8 -003 
13 | 1 014 1 014 0 -002 0 -002 16 | 10 -039 9-09 2:05 7 xà 
12 | 1:07 0 -004 0 -004 | 0 -004 15 | 9-046 | 8 022 | 6 004 6 -004 
11 | 1 046 | 0:07 | 0-07 |—. 14 | 8 -0s0-| 7904 | 5.00 | 5 004 
10 | 0 -013 0 43 |— = 13 | 6 95*| 5 01 4 ооз | 4 -004 
9 |10704 | 0404 == = 12 | 5-024 | 5-024 | 3003 | 3 -003 
8| 0.040 | — EE == 11 | 599-| 4 02 | 3 00 2 -003 
046 019 f || 4 
6 | 18 | 3 010-| 3 0107| 3 0107| 2 00 т 4 1039 5 01s" : = A S 
17 | 3 -035+| 2 -006 2 -006 1 -001 8| 2 01 Loads: | 14% бй 
16 | 2-018 | 2 18 | 1-003 | 1003 7 | 1021 | 1-021 | 0 +004 | 0 -004 
15 | 2-038 | 1-007 | 1 007 | 0 001 6 | 1 -045-| 0 -010-) 0 -010- — 
14 | 1 015-1 1-и5-| 0002 | 0 '002 5 | 0.023 | 0.023 | — = 
13 | 1028 0 -003 0 -003 0 003 
12 | 1 048 0 -007 0 07 | — 18| 19 | 14 4046 |13 920 12 008 | 11 -003 
| 11 | 0-013 | 003 |— == 18 | 12 0» |11 017 |10 00 | 9 -003 
10 | 0.022 | 002 | — = 17 |1094 |10 024 | 8 004 | 8 -004 
9 | 0.037 | — -- == 16 | 9.030 | 804 | 7 006 | 6 902 
15 | 8 -033 7 015+ 6 -006 5 .002 
5 |18 | 3.04 | 2-06 | 2 006 1 -001 14 | 7 055*| 6 016 | 5 :006 | 4002 
17 | 22 | 2 021 | 1-003 | 1 -003 13 | 6 05-| 5 015+ 4 -006 | 3 -002 
16 | 2-048 | 1 оов | 108 | 0 0 12 | 5033 | 4-014 | 3 -05-| 3 005- 
15 | 1-017 | 1-017 | 0 02 | 0 002 11 | 4-030 | 3 -o1 | 2.004 | 2 -004 
| 14 | 1.033 | 0004 | 0 -004 0 -004 10 | 3.025-| 3 925-! 2 008 | 1 -002 
| 13 | 0.07 | 0.07 | 097 | — 9| 3-049 | 2-019 | 1 005*| 0 001 
12 | 0-014 | 0 014 | — = 8 | 2 -038 1 012 | 0 002 | 0 00 
11 | 0.024 | 0 024 | — = 7| 1.025+| 0 0057| 0 -005-| 0 005- 
10 | 0:08 | — = ue 6| 0-012 | 0012 | — к= 
5| 0:27 | — = = 
4 003 1 -003 1 -003 
* 1 1 ped : 010-| 1 -010-| 0 901 17 | 19 13 040 | 12 -016 | 11 -006 | 10 -002 
16 | 1-024 | 1-024 | 0 002 | 0 -002 18 |11 -030 |10 o1 | 9 :005%| 8 -002 
15 | 1.046 | 0 -0057| 0 0057 0 -0057 17 | 10 040 | 9-08 8 008 | 7 003 
14 | 0 -o10-| 0 0107| 0 0107 — 16 | 9-047 | 8 022 | 7 -009 | 6 -003 
| 
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Significance tests in a 2 x 2 contingency table (continued) 


— «Д | 
| Probability Probability | 
l 
| а | | а | 
0:05 0:025 | 001 | 0-005 0:05 0:025 0:01 0-005 | 
A=19 В=17 | 15 | 8 -050-| 7 023 6 -010- 5 94 | А=19 В-14 16, 7 -042 6 -017 5 -006 4 9021. 
14 | 6 023 | 6.023 | 5 010- | 4 -003 15 | 699 5 015+ 4 -oost 300! 
13 | 6 049 | 5 022 | 4-008 | 3 -003 14 | 5 о4 4 013 3 -004 3 004 
12 | 5 :045- 4 -019 3.007 | 2 -002 13 | 4 07 3 -009 3 -009 2 003 
11 | 499 3 -015+| 2 .005- | 2 -005- 12 | 390 | 3-020 | 2-006 | 100! 
10 | 3.02 2 01 | 1-00 | 1 -003 И | 3 -o40 | 2-013 | 1:03 | 100$ 
9 | 2 024 | 2.024 1 -007 0 соо! 10 | 2 -027 1 -007 1 0077 000! | 
8| 2.047 | 1 -015-| 0 -002 0 -002 9 | 1.015-| 1.015-| 0 :002 | 0 00 |. 
Vi 5 31 | 096 | 0-06 | — 8 | 1:00 | 0 -oos+| 0 -oost | — 
6 0914 | Q 014 | — — 7| 0-012 | 0 012 | — = 
5| 0-031 |— = = 6| 0.024 0 024 |= = 
16| 19 | 12 9%- 11 оиз | 10 %5- | 10 -005– Sj ОНЯ j= = T 
18 | 10 -024 10 024 | 9 -o10-| 8 -004 13| 19 | 900 | 9.020 | 8 -006 | 7 002 
17 | 9-031 8-013 | 7 -05%| 6 002 18 | 8-029 | 7 -o10+| 6 -003 | 6 03 
16 | 8 -035-| 7 015*| 6 -006 | 5 -002 17 | 7 оз 6 оп 5.004 | 5 004 
15 | 7.06 | 6 015*| 5.006 | 4 -002 16 | 6.029 | 5 оп | 4.003 | 4003 
14 | 6 034 | $ 014 4 -005+ | 3 -002 15 | 5 025+] 4 -009 4 “009 3 -003 | 
13 | 5 01 | 4-013 | 3.004 | 3 004. 14 | 4-020 | 4.020 | 3 006 | 2 '002 
12 | 4-027 | 3 00-| 3 -o10-} 2 -003 13 | 4 4 3 -015-| 2.004 2 004 
11 | 3-021 | 3:21 | 2-007 | 1 -002 12} 3-029 | 2-009 | 2-009 | 1002 
10 | 3042 2 015-| 1-00 | 1 -004 11 | 2-019 | 2-019 | 1.005-| 1 0057 
9 | 2-030 | 1:09 | 1-009 | 0 о 10 | 2-036 | 1 -o10-| 1 -o10-| 0 00! | 
8| 18 1-018 | 0-003 | 0 -003 9| 1.020 1 -020 0 -003 0 003 |. 
1 : EG | : У 0 007 | — 8| 108 | 0-07 | 0407 | — 
б Wan == = 7 | 0 -015-| 0 015- | — = 
үй йк = = = 6| 00 |— = = 
15| 19 | 11-029 10 on 9 0044 | 9 -004 12 | 19 | 9 -049 - 0057 
4 8.016 | 7 :005-! 7 
18 | 10 046 | 9 то | 8 007 | 7 -002 18 | 7-022 | 7.022 | 6:007 | 5 902 
Т 8 023 | 8.03 | 7-0 | 6 003 17 | 6 022 | 6 022 | 5 өт | 4 902 
4 - = ba Ы | 
У па | 4 :025- | 6 0107| 5 -003 16 | 509 | 5.09 | 4-006 | 3 E 
E- | 6 024 | 5909 | 4 -003 15 | 5.042 | 4 .015+| 3-04 | 3 9? 
nls 02 | 5 022 | 4 -008 | 3 002 14 | 4-032 | 3 ои | 2-003 | 2 9? 
“045+ 4 
Ба ЈЕ 4-18 | 3006 | 2 -002 13 | 3023 | 3.023 | 2 006 | 1 00! 
| 3-014 | 2 004 2 -004 12 ч | | 1 003 
11 | 3-02 | 2 3-43 | 2-014 | 1 00 
10 | 2. 99 | 209 | 1 -002 11 | 2-027 | 1 007 | 1007 | 0 0?! 
020 2 ог -005+ 
0 1 :005+ | 0 -001 nom у 0 002 
9| 2» | 4 on и 10 | 2 -050 1 -014 0 -002 5^ 
8| 1з | 103 | 9 a 0 -002 9| 1-027 | 0-05-| 0 -oos-| 0 9? 
7| 1-046 | 0 о | 0 ооо 0 -004 8 | 1.050-| 0 -o10-| 0 -o10- | — 
6 | 0 020 | 0 -o2 | E 7| 0-019 | 0 из | — == 
3| 0:02 |= = FR 6| 0:05 | — == == 
14 19 |10- Е 1 i aa | 6 6% 
ЈЕ ДИ 10 од | 9 оов | 8 ооз | d 8 9и 7 02 | 6 +003 б 
97 | 804 | 7 -005~| 7 фу 8 7-047 | 6016 | 5 004 | di 
W| 82 | 7-017 | бб | 5.5; 17| 6 08 | 5.05-| 4-004 | 4 p, 
| | 16 | 5-55 4-012 | 3-003 | 3% 
The table shows: | 


(1) In bold type, for given A, B 
quoted (single-tail test). 

(2) In small type, for given A, B 
or less than the integer shown a b 


and a, the val " e 

ue of b (<a) which is just significant at the probability је“ 
and r=a +b 
old type. 


[| 
| 


> th dE o 
Шы. Probability (if there is independence) that 6 is equal у 


Significance tests in a 2х 2 contingency table (continued) 


Probability Probability 
| & a 
| 0:05 0:025 0:01 0-005 0:05 0:025 0:01 0-005 
A=19 B=11| 15 4-027 | 3-008 | 3-08 | 2 002 | А=19 B=7 | 19 | 4 013 | 4-013 | 3-002 | 3002 
14 | 3 18 | 3.018 | 2-05-! 2 -005- 18 | 4.047 | 3 010*| 2 -002 | 2 -002 
13 | 3 035+ | 2 010*| 1 -002 1 -002 17 | 3 -028 2 006 | 2 -006 1 :001 
12 | 2 02 2 021 1 05-| 1 -005- 16 | 2 014 | 2 -014 1 -002 1 -002 
11 2 -040 1 010*| 0 -001 0 -001 15 | 2-08 1 :005+ | 1 -oos+| 0 -001 
10 | 1 -020 1 -020 | 0 -003 0 -003 14 | 1-01 1 01 0 -001 0 соо! 
9 | 1.037 | 0-06 | 0-06 | — 13 | 1.021 | 1-01 | 0 -003 | 0 -003 
8 | 0 015 0 013 | — — 12 | 1.037 0 -005%| 0 -005+ | — 
7 | 0 -025-| 0 025-| — — 11 | 0 :010- | 0 0107| 0 0107| — 
6| 0046 | — — — 10 | 0-017 | 0 0177 | — == 
9| 0-030 |— = = 
10 | 19 | 7.03 | 6.009 | 6 00 | 5002 8| Оов |— = = 
18 | 6 -036 5 011 4 -003 4 -003 
17 | 5 -030 4 -009 4 -009 3 -002 6 | 19 | 4.050-| 3 -009 3 009 | 2 001 
16 | 4 -022 4 022 3 06 | 2 -001 18 | 3 01 2 -005+ | 2 005*| 1 001 
15 | 4 -047 3 -0157| 2 -004 2 -004 17 | 2.015+| 2 -015+| 1 -002 1 -002 
14 | 3.0: | 2-0 | 2 008 | 1 -002 16 | 2.032 | 1 006 | 1-06 | 0000 
13 | 2 017 2 017 1004 1 -004 15 | 1 012 1 -012 0 -001 0 -001 
12| 2 -033 1 -008 1 -008 0 -001 14 | 1 023 1 -023 0 -003 0 -003 
11 | 1-016 | 1-016 | 0 +002 | 0 +002 13 | 1-059 | 0 :005+| 0 -oos+ | — 
10 | 1-029 | 0 -oos-| 0 <005-| 0 “005- 12 | 0 010-| 0 -0107| 0 90-|- 
9| 0-009 | 0599 | O 009 | — 11 | 0:07 | 0 017 | — == 
8| 008 | 008 | — — 10 | 0 028 | — = = 
7| 0:22 | — — == 9| 0 045+ | — = == 
9 | 19| 6-6 | 5 006 | 50% | 4 901 5 | 19 | 3-036 | 2-005-| 2-05-| 2-005- 
18 | 5 -026 4 -007 4 -007 3 -001 18 | 2 018 2 -018 1 -002 1 -002 
17 | 4 -020 4 -020 3 .005-| 3 -0057 17 | 2 042 1 -006 1 -006 0 -000 
16 | 4 -044 3 -013 2 -003 2 -003 16 | 1 -014 1 014 0 -001 0 -001 
15 | 3 -028 2 007 | 2007 1 01 15 | 1.028 | 0-003 | 0 -003 | O -003 
14 | 2.015-| 2 :015- | 1.003 | 1 -003 14 | 1047 | 0 006 | 0-06 | — 
13 | 2-029 | 1.006 | 1:06 | 0 00 13 | 0 оп | 0 0и | — = 
12 | 1-013 | 1-013 | 0002 | 0 002 12 | 0 01 | 0 019 | — m 
11| 1.024 | 1.024 | 0 04 | 0 09 Ок = = à 
10 | 1.042 | 0407 | 007 |— 10 | 0-05 |— = = 
i 0 к 4 = t = 4 | 19 | 2-024 | 2-024 | 1 002 | 1 -002 
71 0.043 | — Ж. ж 18 | 1 -009 1 -009 1 -009 0 -001 
17 | 1.021 1 02 0 002 | 0002 
8 | 19] 5.09 | 599 | 4 00 | 4004 16 | 1-040 | 0.004 | 0 004 | 0 004 
18 | 4 017 | 4-017 | 3 004 | 3004 15 |) “Osoog, МОЈС || оов 
17 | 4.044 3 оп 2 -002 | 2 902 14 | 004 | 094 | — — 
16| 3-027 | 2-006 | 2 006 | 1 0! 13 | (050% | 04 | — == 
15 | 2-013 2 -013 1 -002 1 -002 12 | 0:07 |— — = 
i F -006 0 -001 
Із 3 pes r ba 0 001 | 0 001 з | 19| 1-013 | 1-013 | 0 001 | 0 90 
12 | 1.021 1 ot 0 :003 0 -003 18 | 1 -038 0 003 | 0 -003 | 0 003 
11 | 1-038 | 0-6 | 006 | — 17 | 05060) 10:66 | Ози а 
iO) бон бй; (== Es 16 | 0 013 0 013 | — — 
9 | 0-02 0 00 | — = 15 | 0 -023 0 23 | — — 
8| 0.034 | — - = 14 || а = бр 
6-2 


Significance tests in a 2 x 2 contingency table (continued) 


дыш m 
Probability | Probability 
K — = 
€: | и 0-005 
005 | 0025 | 0-01 0-005 | 0-05 0-025 0-01 
| | | | == — 38 
, | 5 | 5 -004 
А=19 В-2 | 19 | 0 -005-| 0 -05- 0 -005- 0 .005- | А=20 B-18| 15 | 7 w 6 012 5 in des 
18| 0 014 0 014 = — М б i 2 Ea 4 | 3.00 
1 0 029 | — |-- = 024 | 02. per 
16 0-98 | — | = = 12 | 5о 4-020 3-7 2 es 
1 | 4 ош 3 016 2 -oos+ 1 ox 
E 10| 3 өз | 2 012 1 ооз Г E- 
02. 4 007. 
А-20 В=20| 20 | 15 о4 |15 -024 |13 004 | 13 -004 9 2 024 ? E ium] 9-98 
19 |14 46 |13 022 12 -010- | 11 -004 8 048 end юш = 
18 | 12 032 | 11 015*| 10 007 | 9 -003 А £e Gora, bos = 
17 | 11 -041 | 10 020 | 9 009 | 8 -004 6 | 04 = RE 
16 |10 48 | 9 024 | 7 -oos-| 7 005- 5| 0-031 | — 
15 | 8 -027 7 012 6 005+ | 5 -002 | E. 
14 | 709 | 6 013 | 5 00*| 4 -002 17| 20 | 13 0% |1204 | 11 :005* 10 pe 
13 | 6.08 | 5.012 | 4 -00s-| 4 -005– 19 | 11 26 |10 оп 9 -004 9 RT 
12 | 502 | 4 1 3.004 | 3 004 18 | 10 ом | 9 -o1s-| 8 0066 | 7 Ee 
11 | 404 | 4.024 | 3 00 | 200 17| 9-038 | 8:007 | 707 | 6 Er 
10 | 4-048 | 3.00 | 2 00 | 1 -002 16 | 8-040 | 7-018 | 6.00 | 5 Mr 
9| 304 | 2015+) 1-04 | 1 004 15 | 70:9 | 6 017 | 5 007 | 4 P 
8| 2.02 1 010- 1 -o10- 0 -002 14 | 6 0x | 5 016 | 4006 | 3 n 
7| 1.022 | 1-022 | 0-004 | © -004 13 | 5-033 | 4-013 | 3 -005-| 3 pe 
6| 196 | 0 o10* | — es 12| 4-028 | 3010*| 2-003 | 2 ken 
5 | 0.024 | 0 04 | — = Р 11 | 3 022 | 3-022 | 2007 | 1 A 
10 | 3 042 | 205%) 1 -004 t je 
19| 20 | 15 047 |14 020 13 оғ | 12 -003 9|201 | 10 | 1 dos | 0-00 
19 | 13 439 | 12 ois | 11-008 10 -003 8| 1 9 1 :019 [E el p 
18 |11 026 |10 12 | 9 -o0s-| 9 .oos- 7| 1.037 | 0 008 0 о m 
17 |10 -032 | 9 015- 8.006 | 7 -002 6| 0 017 00177 |— 42 
16 | 9 036 | 8-017 | 7-007 | 6 003 5| 0-036 | — — 
15 | 8 -038 7 -018 6 -008 5 -003 | E. 
14 | 7 039 | 6-018 | 5.07 | 4 003 16| 20 |12 o1 |11 o2 10504 10 p 
13| 6.038 | 5 017 | 4 -007 | 300 19 | 11 04 | 10 21 9808 Ri 
12| 5 035+) 4 O15 | 3 4056 | 2 00 18 | 9.96 | gon | 7 004 Г к 
ШІ 498 | 3шз | 2 ом | 2 ом 17 | 8-028 | 7.402 | 6004 5 0% 
10 | 3 о | 2 ооо | 2 009 | 1 00 16 | 7028 | 6 012 | 5 004 4 
.00 
9| 2-019 | 209 | 1 005+ | 0 оо 15 | 688 5 оп 4-004 | 4 00? 
8| 2з | 1.012 0.00 | 0 -002 iá | 25: 099 5 оз 4 -009 3 M 
7| 106 | 0-і O 005+ | _ 13| 5 046 | 4.019 | 3-007 | 2 7, 
£ = ei i= = 12 | 4 ов | 3 014 | 2-004 | 2 E 
T ni Ex И | 329 | 2 o10-| 2 0107 n 00! 
| 10 | 2 -020 2 -020 1 -005+ | 002 
18| 2) и 01 113-017 |12 007 |11 оз 9| 2-039 | 1и | 0 00 | | ой 
| 18 |11 ы» |11 он | 10 -006 | 9 оо 8 1.023 10023 0 004 
17 |10 020 9 ов | 8 ооз 7| 14+ 009 | 009 — 
10 050- 9 0% 7 +004 | р | | 
16 | 8:96 7 7 өш 6| 0:20 0909 — 
| 98 6 005- 6 -005- 5| 0-04 |— din = 
The table shows: | 
(1) Та bold type, for 


quoted (single-tail test). 
(2) In small type, for 
or less than the integer 


given A, B an 


given A, B ң ЕТ? | 
and a, the value of p (<a) which is just significant at the probability 


dr=a+b, th ч ee е 
shown in bold type. T^, Ша exact probability (if there is independence 


4 


e 


M 

J 
ҮШ 
) that b is 99" 


| 
Е 
| 


Significance tests in a 2 x 2 contingency table (continued) 


Probability | Probability 
| 
B | a 
0:025 | 0-01 0-005 005 | 0025 | 0-01 
10.00 10-009 | 9.03 | А=20 В=12 18 | 6 ois | 6 oi$ | 5 0006 | 4. 
| i | 9-016 8 006 7 902 17 | 6.043 | 5-016 | 4-0-| 4. 
906 8 09 7-07 6 0» 16 | 5-034 | 4-012 | 3 003 | 3: 
8. 7-020 6 0088 5 -002 15 | 4 -025+) 3-08 | 3 008 | 2: 
7 6 19 5.007 4002 14 | 4 -049 307 | 2 .005-| 2. 
6: 5 017 4 -006 3 -002 13 | 3 өз 2 -o10-| 2 0107 | 1. 
5: 4 оз 3.004 | 3 -004 12 | 2-020 | 2-020 | 1 -oos-| 1: 
4. 3 ою- 3 010-| 2 -003 11 | 2.0356 | 1:09 | 1-009 | 0. 
3 + 3 02 2 -006 1 -001 10 | 1 -o1s 1-48 | 0-003 | 0 
3: 2 013 1 -003 1 003 9| 1-034 0-06 | 0 06 | — 
2 | 1.07 1.007 | 0 90 s| 04012 | 0 12 | — - 
2 1 .015-| 0002 | 0 0» 7| 095 | 003 |— = 
із 0 .005* 0 005+ | — 6| 0.043 | — == = 
0: 002 | — = 
0. 0 -024 | — = 11| 20 | 8.07 | 7 010*| 6 00 | 6. 
0: — — — 19 | 7.042 | бооз | 5 004 | 5- 
18 | 6 -037 5 -012 4 -003 4: 
10: 10 022 9 007 | 8 00 17 | 599 | 4-009 | 4-009 | 3+ 
9: 8.012 | 7.004 | 7 -004 16 | 4-021 | 4 өш 3.06 | 2. 
8: 7 04 | 6 -0057| 6 0057 15 | 4-042 | 3-014 | 2:03 |2. 
7: 6 013 | 5 005-| 5 10057 14 | 3.028 | 2:08 | 2 00 | 1- 
6% 5 012 | 4 004 | 4004 13 | 296 | 2-016 | 1-003 | 1+ 
5. 4 -009 4 -009 3 -003 12 | 2 029 1 -007 1 007 | 0. 
4: 4 020 3 -007 2 -002 11 104 1 014 0 -002 0. 
4 4 3 -0157| 2 -004 2 004 10 | 1 06 0 -004 0 -004 0: 
S 2.00 | 2 00 1 -002 9 | 1.046 | 0-008 | 0008 | — 
2: 2 018 1 05-| 1 -0057 810-016 | 096 | — — 
2. 1 010-| 1 :010- | 0 901 7| 0-02 |— — = 
1. 1 019 0 003 0 -003 
1: 0 “007 0 97 | — 10| 20 7-030 | 6 908 | 6 “008 5. 
0. 0 014 | — = 19 | 6.031 | 5 99 | 5.00 | 4. 
0: == = = 18 | 5-026 | 4-007 | 4-007 | 3- 
17 | 4 018 | 4 018 3 .005- | 3: 
à j -oost | 7 -002 16 | 499 | 3-012 | 2-003 | 2. 
~ j 3 a 008 | 6 -003 15 | 3.024 | 3 024 | 2-006 | 1: 
7: 699 | 6 09 | 5 903 14 | 3 -o4s+| 2-013 | 1-003 | 1^ 
6: 6 024 | 5 908 | 4002 13 | 2 025+] 1-006 | 1-006 0: 
5. 5 .020 4 -007 3 -002 12 | 2 05-| 1-01! 0 -001 0 
54 4 015*| 3 0057| 3 005 11 | 1 021 1 021 0 -003 0. 
4: 3 оп | 2.003 | 2003 10 | 1-037 | 0-06 | 0:06 | — 
Зы 3 022 | 2 006 1 :001 9 | 0-012 | 0 012 | — == 
3. 2 013 1 -003 1 -003 8 | 0.022 | 0:022 | — - 
PE 1 -007 1 -007 0 -001 7 | 0 038 | — = = 
4 Е -002 | 0 -002 
4 4 | à но | 004 0 -004 9 |20 | 6 023 6 -023 5 005%) 4: 
ы === 19 | 5:22 | 5:22 | 4 005*| 3: 
T 4 0 -009 0 -009 
Ox 0 ов | — = 18 | 4 -016 4 “016 3 -004 З ч 
= = 17 | 4 07 3 -010+ 2 002 2: 
95 Г | | 16 | 3 22 | 3 022 | 2 005+ 1 . 
4 -004 | 7 -004 | 15 | 3 оз 2 012 1 -002 . 
2 . - p^ | р 006 | 5 -002 | 14 | 2з | 2023 | 1 007] 1 


Significance tests in a 2x 2 contingency table (continued) 


Probability Probability 
a a | 
0-05 0-025 0-01 0:005 0:05 0:025 | 0:01 0-005 
A=20 B=9 |13| 2.041 | 1-009 | 1-009 | 0 ом |A=20 B=6 | 14| 1-032 | 0 004 | 0 00 | 0004 
12 | 1-18 1 -018 0 -002 | 0 -002 13 | 0.007 0 -007 0 907 | — 
11 | 1.032 0 -005-/ 0 -005-! 0 -005- 12 | 0 -013 0 013 | — == 
10 | 0 -009 0 009 0 -009 | — 11 0 -022 0 022 | — == 
9 | 0.017 0 017 | — — 10 | 0 0357 | — -- <= 
8| 0.02 | — = = 
7|00-|—  |— - 5 |20| 303 | 2-004 | 2 04 | 2 0 
| 19 | 2-016 | 2-016 | 14002 | 1 00 
8 |20| 5 ол 5 017 | 4 -003 4 -003 18 | 2.038 1 005+ | 1 -005+ | 00% 
19 | 4 o5-| 4 -015-) 3.003 | 3 -003 17 | 15012 | 1-012 | 000 | 0 00! 
18 | 4 -038 | 3009 | 3.09 | 2 -002 16 | 1.023 1 023 0 ог | 00% 
17 | 3 022 3 -022 2 -005-| 2 -005- 15 | 1 -040 0 .005-| 0 05-| 0 95 
16 | 3 -044 2-011 | 1 -002 1 -002 14 | 0 -009 0 -009 0 00 | — 
15 | 2 022 | 2 -o22 1 -004 1 -004 13 | 0 ө5-| 0015 | — = 
14 | 2-40 1 -009 1 -009 0 -001 12 | 0 -024 0 -024 | — == 
13 | 1 016 1 :016 0 -002 0 002 11 | 0.038 | — — = 
12 | 1 -029 0 -004 0 -004 0 -004 
11 | 1-048 | Oo | 0 -oog | — 4 |20| 2-022 | 2:2 | 1 002 | 1 0% 
10 | 0-014 | O-o14 | — - 19| 1-008 | 1-0 | 1 0 | 0 99 
9| 0.024 | 0 024 | — = 18 | 1-018 | 10m | 0ш | 0 99 
$ Qr p == — 17 | 1озз+| 0-003 | О ооз | 0 0% 
16 | 0.07 | 0.07 | 007 | — 
7 |20| 4-012 | 4 02 | 3.00 | 3 002 15 | 0-012 | 0002 | — — 
19 | 4 -042 3 -009 3 -009 2 001 14 | 0 20 0 -020 | — re 
18 | 3 -024 3.024 | 2-05-| 2 .005- 13 | 0 on | — a 
17 | 30507 | 2 оп 1 -002 1 -002 12 | 0 7 | — — E 
16 | 2 -023 2 -023 1 -004 1 -004 | 
15 | 2-043 | 1-009 | 1-009 | 0 -oo1 , 8 001 | 0 99 
4 | tae | Loc | Om | бзш аа бш 
13 | 1») 0м | 0 ом | 0 004 18 | 0-006 | 0-6 | 006 | — 
12 | 104 | 0 007 | 0 007 | — 17 | 00и | боп | — =й 
П| 0013 | 0оз |— = 16 | 0-020 | 0 xs 
10| 002 | 0.02 |— = udi =i 
а | ул. | —. 15 | 0.032 |-- = 
E nm 14| 0:09 | — — == 
6 |20| 4.04 ' mi 
19| 3 bee 7 p. е cl sea 2 |20| 0-004 | оом | 04 | 07 
18 | 2-013 | 203 | 1 оо 1 ao БЕ 0 оз | 0 013 | — Е. 
17 | 2 ов 1 -004 Шен 1-55 і 026) | = — 2% 
16 | 1.010-| 1 о10- пас буюл 7| 093 | — = 
15 | 1018 1 18 0 -002 0 -002 
1 |20| 0.04 | — = = 
The table shows: 


(1) In bold type, for 
quoted (single-tail test). 
(2) In small type, 
or less than the inte; 


given A, B and a, 


К 
ње Је 
the value of b (<a) which is just significant at tho probability ] 
for given А, B and r= | 


a+b, t стр ж е 5 al 
ger shown in bold type. D probability (if there is independence) that b is eq" 
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A METHOD FOR JUDGING ALL CONTRASTS IN THE 
ANALYSIS OF VARIANCE* 


By HENRY SCHEFFÉ, Columbia University 


A simple answer is found for the following question which has plagued the practice of the analysis of 

variance: Under the usual assumptions, if the conventional F-test of the hypothesis И: ду = lg =... = t; 

nt the a level of significance rejects H, what further inferences are valid about the contrasts among the 

д; (beyond the inference that the values of the contrasts are not all zero)? Suppose the F-test has k—1 
. k 


and v degrees of freedom. For any сі, «++» Ck with X c; = 0 write 0 for the contrast У сд, and write 


stimates of 0 and the variance of 0. Then for tho totality of contrasts, no 
the probability is 1—2 that they all satisfy 
0—5865 «0 «0-- 863, 

where S? is (2-1) times the upper æ point of the F-distribution with k—1 and v degrees of freedom. 
Suppose we say that the estimated contrast @ is *significantly different from zero’ if | | > 506. Then 
the F-test rejects H if and only if some б аге significantly different from zero, and if it does, we сап say 
just which 6. More generally, the above inequality can be employed for all the contrasts with the 
obvious frequency interpretation about the proportion of experimentsin which all statements are correct. 
Relations are considered to an earlier method of Tukey using the Studentized range tables and valid 
in the special case where the fi; all have the same variance and all pairs 2, Й; (24-7) have the same 
covariance. Some results are obtained for the operating characteristic of the new method. The paper 
is organized so that the reader who wishes to learn the method and avoid the proofs may skip {2 and 5. 


6 and 85 for the usual е 
matter what the true values of the 0's, 


1, STATEMENT OF THE METHOD 
The general problem is that of making inferences about the contrasts among a set of ‘true 
n the analysis of variance. For example, the ji; 


Means’ or ‘true main effects’ д, las -+> Hr Ì 
might be the true row effects in a two-way lay-out with possibly unequal numbers of 
observations per cell. The д; may be unrestricted or subject to a single restriction of the form 


k 
Shims =h, (1) 
1 
к А _ 
Where the 7; and h are known constants with X 4+0. A contrast is a linear function of 
the His k | 
д= ус (2) 
1 
determined by k known constants 6; satisfying the condition 
k 
De; = 0. (3) 
1 


on for a particular set ови will be called the value ofthe contrast; 


The value of the linear functi 
llowing to use the same symbol 9 both for the contrast 


it will not cause any confusion in the fo 
and the value of the contrast. 

We make the assumptions usual in the a 
а set of statistics Ди, До» +: fi, and 82, sue 
tribution and are statistically independent of 02, that 

Ей) = (= Ши), 

and cov (fi; f) = 40% (i,j = 1, s E). % 
Where the constants @;у ате known, and c? is unknown. The д; will always be ds d 
(non-random) effects, as discussed by Eisenhart (1947) or Mood (1950). In a pure Мосе 


red by the Office of Naval Research (U.S.A.). 


nalysis of variance, namely, that there is at hand 
h that the 2; have a multivariate normal dis- 


ж Work sponso 
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situation, c? is the variance с? of a single observation (‘error variance’). In a mixed model 
situation where there exists an exact F-test of the hypothesis 


Н: ply = =... = щ, (5) 
о? will equal of plus further unknown non-negative parameters. In any case, 0° is an 
estimate of о? with v р.ғ. (degrees of freedom), that is, vê?/o? has the y? distribution with 
v D.F. The case where о? is known can be treated by obvious modifications of the theory 
below, usually merely by putting v = coand 8? = о? in the results. It is further assumed that 
if the д, are unrestricted the rank* of the covariance matrix with elements (4) is k, and that 
if the д; are subject to a restriction (1) then the Ê; are subject to the same restriction (1) and 
the rank of the covariance matrix is k— 1. 

The hypothesis H in (5), equivalent to the statement that all the contrasts are zero, can 
be tested by the conventional F-statistic with 2-1 and v p.r. We shall refer to this test 
at significance level о as ‘the’ F-test of H. The problem of making further inferences about 
the contrasts, arising when the F-test rejects H, has been considered by various writers, 
including R. A. Fisher (1935), D. Newman (1939), J. W. Tukey (1951), and Н. К. Nandi 
(1951). Except for Tukey’s and Nandi’s, the methods involve repeated tests of significance 
on the same data, and are hence subject to the usual objection that little is known about 
the joint operating characteristic. While it is often not possible in practical applications 
to avoid repeated tests of significance, it is possible for the particular problem we are 
considering. 

The solution studied in this paper is based on the following probability statement about 
the infinite totality of contrasts:t For any contrast (2) denote its estimate by д, 


У с;й, 
1 
the vari ) а У У 
е variance of Бу o2, оў = D aute; 0°, (6) 
+=1ј= 
and the estimate of this variance by 62, 
k k 
б) = X Xage;c6 
i=1j=1 
Define the positive constant S from 
8? = (k— 1) (2-1,2), (7) 


se rank conditions to be satisfied in 
later for the mathematical arguments 


8 of hypotheses suggested by tl et in 
ey on ‘New methods 1 -> Suggested by the data, I first m 
білі» не in the analysis of variance: Range in the numerator 


8”, an unpublished invited 
Mathematical Statistics and the Eastern 
March 1952 at Blacksburg, Va.; it differs 
ы = root-mean-square instead of a эн 
већа bes y . Communicated the metho sed on ( 
iely m pce to him and Prof. D. B, Duncan rni timor 
gave a lecture at Blacksburg in November 1951. 
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Where F (1 — 1, v) denotes the upper © point of the F-distribution with k— 1 and v D.F. Then 
the probability is 1 —c that the values 0 of all the contrasts simultaneously satisfy 


0—86;«0 «0-4 86;, (8) 


no matter what the values of all unknown parameters. The proof of this statement will be 
given in $2. 

This result may be used for the interval estimation of all contrasts of interest, including 
any suggested by the way the observed means fi; fall out. No matter how many contrasts 
are estimated by the method (8), the probability that all the statements thus made will be 
correct will be > 1—а. 
| The result may also be used to declare any estimated contrast ‘significantly different 
from zero’ or not, according as the corresponding interval (8) excludes 0 = 0 or not. More 
precisely, after selecting а set of coefficients с; subject to (3) and thus determining a contrast 
уге make one of the following three statements (it will be convenient also to say we make the 
Statement for the contrast as well ав about its estimate): 

(i) Ô is not significantly different from zero, 

(ii) O is significantly different from zero and positive, 

(iii) 0 is significantly different from zero and negative. 

We make statement (i) if —56 < 0 < 865, (ii) if 0> 565, (iii) if Ô< — 865. The operating 
characteristic of this method is studied in $4. 

We warn the reader here that in the special case where all the 2; have the same variance, 
and all pairs 7, 0 ; (ij) have the same covariance, and where the only contrasts of interest 
are the 1-1) differences ji; — ду, the method of Tukey described in §3 should be used in 
preference to the above, because the confidence intervals will then be shorter. An example 


of such a case may be found in Scheffé (1952). 


9. PROOF OF THE METHOD 
ade with the aid of a linear transformation and other mathematical 
5 for proving results about the operating charac- 
ients of the transformation will be regarded as 
d not be computed for the practical 


The proof will be m 
apparatus which will again be useful in § 
teristic stated in $4. Although the coeffie 
known in the mathematical discussion, they nee 


applications. Я 
The dimension of the space of estimated contrasts 0, regarded as linear forms in 


indeterminates /, is 0-1. Under the assumptions we made about the rank of the 
Covariance matrix (4), we may find a basis for this space such that the corresponding 
random variables 0, .... Îr- Will be statistically independent with equal variance 


2 2 
с? = о?, 


where С is some chosen positive constant. The choice of С for the purposes of 54 will be 


k 
discussed there; it does not matter at present. Let ў, = (жы rj ; Then between the //; and 


the 2, there will exist à non-singular linear relationship, 


k 
fxh (b= 1, (9) 
ігі 
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where the b;; are constants not depending on unknown parameters. Writing E (7) = 1). we 
d 
et from (9) Е . 
в “= У бат (= 1,...,Ю). 
гі 

k k k 
Then 0-УХош-У Убу. 

ізі i=1j=1 
The coefficient of 7, must be zero for all бі; 


езе, Satisfying (3), so б, does not depend on 
i, and we may write 


k-1 a Е-і m 
0= 710 = У ай, 
ј=1 1=1 


k 
where d; — Жабу (j = 1,...,4-1). 
i 
k- k-1 
Now оз = 229% = Cot Уа). (10) 


A contrast 0 for which o? = Са 


c? will be called а normalized contrast and denoted by 9. For 
a normalized contrast 


k к-1 
Ә-Хош- Уй, (11) 
1 1 
k-1 
we then have > di = 1. 


Clearly it suffices to prove that the probability statement associated with (8) holds for the 
totality of normalized contrasts. We в 


hall do this by means of some simple geometric 
considerations, 
Let us introduce a (&—1)-dimensional s 


pace, which we shall call the 
У = (Jy... Vy л), for graphing the paramet 


er point or vector N = (M, 
k-1 
fj, and other quantities, The random variables y; (1: — 17.) (Со?) and уб2|02 have inde- 
1 
pendent 4? distributions with k— 


1 and vp.r., reg 
quotient has the F-distribution. This yields the co 


k—1 
X (y; — 1) < 890, 
ігі 
for the parameter point 7, where 52 
parameter point 7) is covered Бу. 
Now anormalized contrast is uni ) 
) COH | rd = (d,, ..., dy-1 
of unit length, and it will be convenient in this secti A 


-with 
the vector. It is seen from ( e contrast wit 
Similarly, the value 


for 9 becomes 


y-space of points 
++ Тр), its estimate 


pectively, and go v/(k—1) times their 
nfidence sphere F, 


a2) 


is defined by (7). The Probability is 1— о, that the 


$806 «9$. 606 


— Аи 
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3. COMPARISON WITH A METHOD OF TUKEY 
In the special case where all the 2, have the same variance ао? and all pairs fli fi lij) 
have the same covariance a,56?, the following method of Tukey* is applicable. The prob- 
ability is 1 — а that the values 0 of all the contrasts simultaneously satisfy 


ó— T6 «0«0-- TG, (14) 
where the constant 7 is defined as ) 
ТА 
Т= 2X | e; | (ал — 922), (15) 


and 4 is the upper ж point of the Studentized range, for the range of a sample of / in the 
numerator, and р.ғ. in the denominator, that is, the upper « point of the quotient w/s, 
where w and s (=> 0) are statistically independent, w is the range of a random sample of 
k standard normal deviates, and vs? has the x? distribution with v D.F. This has been tabled 
by J. M. May (1952) for а = 0:05 and 0-01, and needs to be tabled for æ = 0-10. 

We propose to compare the efficiency of the two methods by use of the ratio R of the 
Squared lengths of the confidence intervals (8) and (14), 

R- (5263) (T202). 

The motivation for using the squared lengths is that if for a particular contrast the value 


of В thus defined is Ry, we may say that for large samples the method (8) requires / times 


as many measurements as (14) to give the same accuracy on this contrast. 


k 
After noting that оў = (а — 0:3) 7? X cz 


may express the ratio Ras 


atest practical interest are perhaps those consisting of the 
the и; and the average of r of the other и, (m. r < k); 
(m, т). For instance, a contrast of the type (2, 3} is 


(17) 


in the present case, we т 


The contrasts usually of the gre 
difference between the average of m of 
we shall symbolize this type of contrast by 


шин) – Ма + Ha + Ив). 


It may be shown that R attains its maximum value Rmax. for a contrast of the type {1, 1), 
that is, a difference of two д, and its minimum value Rin, for one of the type {3%, $H} if 
kis even, (46-1), КЕ + 1)} if k is odd, and hence 

= 2(8°/9°), 

4k-(S?|g?) for k even, 
Ruin. = = 1)-1 (82/4?) for k odd. 

w the relative efficiency of the two methods varies with k, the number of 
co. The rows headed 1/В тау, show the efficiency of method (8) relative 
i differences of two Jt; Rmin, shows the efficiency of 
e other contrasts. The value of S?/g? is also tabled 


Rmax. 


Table 1 shows ho 
means, in the case v = 
to method (14) on contrasts which are 
method (14) relative to method (8) оп som 

* See footnote on p. 88. 
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for use with (16) for the calculation of R. Table 2 shows that the value of R is ng mes 
sensitive to v. Of course any increase of S?/g? above its value for v = co listed in Table 
favours method (14). 


Table 1. Variation of relative efficiency R of two methods for different contrasts (v = oo) 


| For k= | 
a Value T — É— — M 
MES 21 sit tel 6 |e baa | a | 16 20 
| | | | 
| | | | 
0-10 | S*/g* | 0:50 | 0-55 | 0-60 | 0-64 | 0-69 | 0-78 | 0-86 | 0-98 | 1.09 1-23 
E, 1-00 | 0-91 | 0-84 | 0-78 | 0-73 | 0-64 0:58 | 0-51 | 046 0-40 
Rain, | 1:00 | 0-82 | 0-60 0-54 0-46 | 0-39 | 034 0-30 | 0-27 0:25 
”: | 
0:05 | 52/2 | 0-50 | 0-54 | 0-59 | 0-64 | 0-68 | 0-76 | 0-85 | 0-96 | 1-07 | 1-20 | 
Џул. | 1:00 | 0-92 | 0-84 | 0-79 | 0:73 | 065 | 0-59 | 0-52 | 0-47 | 0-42 | 
y 1-00 | 0-82 | 0-59 | 0553 | 0-45 | 0-38 | 0-34 | 0-20 | 0-27 | 0:24 | 
4 | 
| | 
0-01 | 8%4% | 0-50 | 0-54 | 0-59 | 0-63 | 0°67 | 074 | 0-81 | 0-92 | 1:01 | 1-14 
1, | 1:00 | 0-92 | 0-85 | 0-80 0.75 | 0-67 | 0.61 | 055 | 0-49 | 0-44 | 
Еш. 1:00 | 0-81 | 0-59 | 0:52 | 0-44 | 0-37 | 033 | 0-28 | 0-25 | 0-23 
| | | | | m | 2 
Table 2. Values of 8949, showing dependence of relative efficiency on v 
| 
ТА | | 
т & 3 6 10 13 16 20 | 
: | 
0-05 5 0-56 0-70 0-89 1-03 1-15 1:31 
т 0:55 0-69 0-87 1-00 1-13 1-28 
10 0-55 0-69 0-87 0-99 111 1-26 
20 0-55 0-68 0-86 0-98 1-09 1:25 
40 0-55 0-68 0-85 0-97 1-08 1.23 
© 0-54 0-68 0-85 0-96 1-07 1-20 | 
0-01 5 0-57 0-72 0-91 1-05 1:18 1-34 
7 0-55 0-69 0:87 1-00 1:19 1.27 | 
10 0-55 0-68 0-85 0-98 1-09 1-24 
20 0-55 0-68 0-84 0-96 1-07 1-21 
40 0:54 0-67 0-83 0-94 1-04 1:18 
© 0-54 0-67 0-81 0-92 1-01 1:14 
The difference be 


ethods is seen to increase wi 

#4— и, While 
contrasts (and gets worse faster). It is clea 
Special ease where (14) i 


interest,* (It is 


x increas 8) gets 
the differences th k; as k increases (8) g 


(14) gets relatively worse on some other 


the confidence E 
could be constructed w 


information that the 
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one with the results we like better—unless we are willing to settle for an overall confidence 
coefficient known only to be > 1— 22, in which case we may choose for each contrast the 
shorter of the two intervals.) If we are interested exclusively in the differences и,— 4j, we 
should choose (14); if we are interested in many types of contrasts, and in investigating 
contrasts suggested by the data, (8) would seem superior. Tables 3a and 3b show the 
relative efficiencies in the cases k = 4 and k = 6 on most contrasts likely to be of practical 


Table 3a. Relative efficiency of two methods Table 3b. Relative efficiency when 
when k = 4 (æ = 0:05, v = оо) k = 6 (а = 0:05, v = со) 
= | | | 
| | | Туре of 
| Type of contrast | 1/R R | contrast 1/R R 
| (1, 1) | 0-84 11,1) 0-73 
{1,2} | 0-89 {1, 2} 0-98 
{1,3} | | 0-79 (1, 3} 0-91 
(2, 2), quadratic | 0:59 | 
Linear, cubic | 0-74 à {1,4} 0-85 
| {1, 5} 0-82 
{2, 2} 0:68 
12, 3} 0:57 
{2, 4} 0-51 
(3.3) 0-45 
Linear 0:59 
, Quadratic 0:57 
Cubie 0-48 
Quartic 0-53 
Quintic 0:67 


interest.* The type {m,r} is described above (17). The contrasts for linear, quadratic, еїс., 
effects are the contrasts used in fitting orthogonal polynomials when the /4; correspond to 
equal steps of some independent variable. The coefficients с, for this type of contrast are 
listed in Fisher & Yates’s tables (1943). The relative efficiency of method (14) is given in the 
column headed R, of (8) in the column 1/R. It is seen that for k = 4, method (14) is superior 


only on the type (1, 1}, for k = 6 also the type {1, 2}. These tables are for у = оо and require 
some correction in favour of (14) for small values of у; the correction factor is the ratio of 


entries in Table 2 for the given у and for у = 00 аба = 0-05. 

obtained by a uniform contraction of # about the centre of 
would be a hopelessly complicated calculation for most 
equal variances and equal covariances, and the contrasts 
of interest are the 2( — 1) differences /^; — Its Tukey's method 9688 calculate thek”. EH” is available, 
we can of courso infer about other contrasts by projecting Ж' on their vectors d, as we projected A i 
This is equivalent to (14) for Tukey's method. and we will see below that this does not more as e 
for many contrasts of interest. We suggest that it will often be worthwhile to the n pay o 
more than he will use if he does not know beforehand just what he will need and can buy a goo 


Package. 


principle it is possible to calculate an ж ' ob 
< and to pay just for this; in practice this 
cases. In the special case where the //; have 


* Tán ti dho ntrasts. For example, if и; is the true mean for the ($, 7) 
cell in еж = yos the two-factor interaction in the (1, 2) cell is дуз — 1. — H.a +... where the 
dots have their капа! одавао: and this is a contrast among the д). General superiority of (8) over 
(14) for the interaction contrasts is indicated by some numerical calculations. 


be regarded as со: 
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4. OPERATING CHARACTERISTIC OF THE METHOD 


For many purposes it may suffice to decide whether an experiment has adequate m 
by considering the lengths of the confidence intervals (8). In this section we consider the 
effects of the method from a viewpoint close to the Neyman-Pearson (1933) concept of the 
two kinds of error possible in hypothesis-testing. We preface this rather long development 
with a few remarks. . | 
The power of the method will turn out to seem low to one accustomed to power calculations 
for a single t-test.* This has its counterpart in estimation, in that for Ё > 2 our confidence 
interval (8) for a contrast will usually be much longer than the 100(1— 2) % confidence 
interval for this contrast alone based on the t-distribution (which it is, of course, not valid 


Table 4. Values of 125° 


Ep S | ЕЕЕ = | ao |) те | 38 | 20 
a v 
O10 | 5 | 1-00 | 0:54 | 0-87 | 0-29 | 0-24 | 0-17 | 0-14 | 10 | оов | 0:07 
7 | 1:00 | 0-55 | 0-39 | 0-30 | 0-25 | 0-18 | 0-15 | 0-11 | 0.09 | 0.07 
10 | 100 | 0-56 | 0-40 | 0-32 | 0-26 | 0-19 | 0-16 | 0-12 | 0.10 | 9,05 
20 | 100 | 0:57 | 0-42 | 0-33 | 0-28 | 021 | 0:17 | 0-13 | oat | 0:05 
40 | 100 | 058 | 042 | 0-34 | 0-28 | 0-22 | 018 | 0.14 | 0-11 | 000 
9 | 100 | 0-59 | 0-43 | 0-35 | 0-29 | 0.23 | 0-18 | 0-18 | 0.12 | 10 
005) 5 | 1-00 | 0-57 | 0-41 | 0-32 | 0-26 | 019 | 015 | 0-12 | 0-10 | 0-08 
o | 1:00 | C59 043 | 034 | 028 | 021 |017 | олз| ore | 098 
20 | 1:00 | 041 | 045 | 0-36 030 | 0-28 | ола | 014 | Cod 0-09 
20 1:00 | 0-62 | 0-47 0-38 | 0-32 | 0.95 0:20 | 0-16 | 0-13 | 0-11 
40 1-00 | 0-63 | 0-48 0-39 | 0-33 | 0.26 0:21 | 0-17 | 0-14 0-12 
% | 100 | 0-64 | 0-49 | 0-40 | 0:35 | 0-27 | 0:23 | 017 0:15 | 0-13 
90411 5 |100 | 0-61 | 0-45 | 0-36 | 0-30 | 0-2 0-18 | 0-14 | 0. 09 
7 | 100 | 064 | 0-48 | 0-39 | 0:33 | 0-25 | 0:20 016 913 0-10 
10 | 100 | 0:66 | 0-51 | 0-42 | 0-36 | 0-98 | 0.25 0-18 | 0-15 | 0-12 
20 | 100 | 0-69 | 055 | 0-46 | 0-39 | 0.31 | 023 0:21 | 018 | 0-14 
40 |140 | 071 | 057 | 048 | 042 | 0.33 | one 0-23 | 0-19 | 0-16 
9 | 100 | 0-72 | 0-58 | 0-50 | 0-44 | 0.36 | 0.31 0-25 | 0-22 | 0-18 


for a 95 % confidence no к А 95 % confidence that all are correct, а8 


he contrasts selected before the data are 
aving the same expected length. 


vals in connexion with Tables 5 and 


3 — a = 
—M 
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Instead of considering the present method from the point of view of one accustomed to 
the method of repeated t-tests, or repeated confidence intervals based on the same data, it 
is instructive also to do the reverse. The theory of the method (8) permits the calculation 
of a lower bound for the overall confidence coefficient implied by the use of repeated 
t confidence intervals for the contrasts, calculated from the same data. Thus, for repeated 
95 % confidence intervals based оп ¢ we may calculate the bound by substituting in place 
of S in (7) the two-tailed 5 % point of ¢ with v D.F., and solving the resulting equation for 
1-о. The values of this bound shown in Table 5 may startle the reader when he considers 
that by increasing the number of repeated t confidence intervals, the overall probability 


Table 5. Lower bound for probability that all repeated t confidence intervals for 
contrasts will be correct (v = oo) 


coeff. 
0-90 0-90 | 0:74 | 0-56 | 0:39 | 0-25 | 0:09 | 0:03 | 0-003 0-082 |<0-0'1 
0:95 0-95 | 0-85 | 0-72 | 0:57 | 0-43 | 0:20 | 0-08 | 0:01 | 00027 0-048 
0-99 0:99 | 0:96 | 0:92 | 0:84 | 0-75 | 0-53 | 0:32 | 0-12 ! 0:03 | 0-004 


Table 6. Lower bound in certain cases where repeated t confidence intervals are 
used only on differences pt; — из (v = 00) 


Conf. 
coeff. 
р .77 | 0-65 | 0:53 | 0-43 | 0:28 0-17 | 0:08 0:04 0:01 
0.98 d В 0:80 | 0-71 | 0:63 | 0-49 | 0:37 | 0-24 0-15 | 0-08 
0-99 | 0:99 | 0:97 | 0:95 | 0:93 | 0:90 | 0:84 | 0:77 | 0-67 | 0:58 | 0:48 


ound. If repeated ¢ confidence intervals are used 
only on the differences ја; — у, and if the 2, have equal variances and equal covariances, then 
the theory of Tukey’s met: т bound, found by solving for 1—« the 
equation д = 2.5, where 4 is the same function of æ, k, v as in (15). This bound is attained 
if all (0-1) statements are ma 
Shown in Table 6. We may try to t 
žk(Ł—1) statements based on 5 96 m 
of all the statements we made correct, but we 5 
which we rk likely to pay the most attention more than 5 % tend to be wrong; for example, 
Statement: iated with the larger observed differences. 

The оеш of the new method is exactly the same as that of the F-test of the hypo- 
thesis H in (5) at significance level a in the following sense: If the F-test accepts the hypo- 
thesis Æ that all the contrasts are zero, the new method will make statement (i) for all 


Contrasts, that is, say по contrast is significantly different from zero; if the F-test rejects 


4 
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the hypothesis, the new method will make statements (ii) and (iii) for some тате 
is, say they are significantly different from zero (and positive or падан» е, respec y) 
This may readily be seen* from the geometrical picture introduced in §2. — 
To formulate the problem of the power of the method, we consider the probabilities a 
making each of the statements (i), (ii) and (iii). We might think at first that what we e d 
like is to make statement (i) for a contrast whose true value is zero, (ii) for one whose tr к. 
value is greater than some assigned 0,, (iii) for one whose true value is less than = 0, anc 
that what we want are the probabilities of the desired statements in each case. This formula- 
tion is at once seen to be nonsensical, because for any contrast 0, whose true у 
there exist constants h’ and л” such that the true value of 0, is «0, 
А0, is > 0s, and yet the same one of the statements (i), (ii) and (iii) 
contrasts 10, with positive h. This difficulty can be 
the suitably normalized contrasts; if we find what 
we will know what happens for all contr: 
is normalized if the variance of its estimate is C?g?. where С is 
normalized contrasts and their estimates we then write 9 


To see how the above difficulty is met, suppose the struct: 


the contrast jj — H is determined with greater precision t 


а property of the design which presumably was desired 
probability P of detecting (that is, making statement (ii) 
as бр, we should be satisfied with the 

great as a certain multiple of 0o. The 
seen to be the ratio of the standar 
property may be expressed by sayi 
detecting all normalized contr 
а way of normalizing the con 


alue is positive 
о, the true value of 
will be made for all 
avoided by laying our requirements on 
happens for the normalized contrasts, 
asts. The appropriate definition is that a contrast 
a specified constant; for 
and 9 instead of 0 and 0. 

ure of an experiment is such that 
han the contrast jt, — Us. This ін 
‚ and if we guarantee a certain 
for) a difference [t — fly 28 great 
same probability P of detecting a difference Jta = Hs 8 
multiple assigned by the present method will later be 
d errors of estimate of the two contrasts. Since this 
ng that *the method assures the same probability P of 


2565 as great as a certain bound 9,’, it indicates that we have 


| | trasts which is suitable for this method. 
While this motivates our normalizing the contrasts so th 


қ at their estimates all have the 
Same variance (202, there is still the question of the choice of the constant C, Whether there 
18 a satisfactory universal choice of С is dubious, + In any event, the question is of little 
practical importance, and we need not settle it in order to analyse the operating charac 
teristic. If for the same experiment two statisticians choose C and С”, then equivalent 
choices of the bounds 9 and 9; are related by 9, = (0' ІС) 9. | 
А simple example illustrating these considerations will 
pric v experiment in а“ one-way lay-out’ with k = 6, th 
ervat 

е bissl cd Tues ч s f aS ОП /44, бб, lg, and that we are prim 
%-ш (for equal numbers of Observa: 


$2, H may be written = 
covers the origin, and this is preci 


be helpful. Suppose we wish 10 
at we wish to take twice as many 
arily interested 1? 

tions per mean ya 
* In the notation of 


fidence Sphere £ 0. Thus the P- 


( test accepts Н if and only if the сой” 
isely th 


1 
e case where statement (i) will be made abo" 


the following: б fr, to É E 
transformat, 1 Owing: transform from ñ, ..., A. to бе? 

pendent. The set P. В ай that B, ig «ће ls of 82, and Ё 5 f nd tigally ve 

бања > 5k-1 then spans the Space of t} inpia Па шү 

7 15 the geometric mean of а a ет 

be shown that in the с: i Me 


е 
ts. Let o? = var (£;). рейт! 


the aj; т (6) alone. № m 
gives C = n- ; more genere 


; 7 k Ge 

т, this gives Сел) =a fi 7, where 7 = Б nil 
b 

» Which does not particular]: : 


ra C ist 


= Gn, 
ew. 
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should then use Tukey's method). These contrasts will then be determined with three kinds 
of precision, namely, the precision of those like //, — иу. like да — из, and like Из — Ша. the three 
Standard errors of estimate being in the ratio 2! : 4! : 31, Suppose we are satisfied with a 
probability of P = 0-90 of detecting a difference д; — дз as great as 1-00. We will then get 
the same probability 0-90 fora difference и; — изаз great as 1-00 (4?/2}) = 1-40, ог a difference 
/^5 — [ty as great as 1-0¢(3!/2!) = 1-20. If for either of the last two differences this sensitivity 
is not considered sufficient then we should question whether we have chosen the correct 
design. 

Let x be the number of observations on у, б. e and 2» the number on д. д5, Из. While 
the choice of the normalizing constant C does not matter much and might even be left 
indefinite, we shall suppose for concreteness in this example that the contrasts are normalized 
so that their estimates all have the same variance as that of ду — д5, namely, озјт. Since 


for any contrast 


k 
the coefficients in a normalized contrast = У c;//; must then satisfy the condition 
1 


13 6 

За ais 
->а-Хсі- 1. 
21 1 


Thus the normalized form of the contrast да — Из is 275 (t4 — 45), and of t — jt. (8)* (1 — и). 
Below we shall find how to choose % so that the probability is 0-90 of detecting a difference 
of ji — и, equal to a given multiple of ø, say 1:00 = 95. The probability will then be the same 
for the value 8, of the contrast 2-1 (да = 15) or the value % of the contrast (5) (= д). 
This gives the statements in the last paragraph about the differences да — д5 and ја — jt. 
In such an experiment the contrast consisting of the difference of the average of ду, Ho, Jt 


and the average of у, б, из might also be of interest. Its normalized form is 
o 4 5 


6 


3 
($)! (Z.- X). 
1 4 


3 6 
and we find in a similar way that a difference of the averages У [3 and Y ji;[3 equal to 
5 2 - 
detected with probability 0-90. 

In the following calculations we shall symbolize by Ғ(/; д) a non-central ¢ variable with 
Јр, and non-centrality parameter 9, that is, a variable distributed as the quotient of 2 + д 
by f — 1, where zis a standard normal deviate and y is an independent x variable with f D.F. 

Statements (i), (ii) or (iii) are made for a contrast 0 according as 0/6; falls in the intervals 
(-5,5) (8 со) "t (—со, — 8). But the variable 0/8; has the non-central ¢-distribution with 
vo.. and parameter 6 = 0/9, where ср is given by (6). It follows that the probability of 
each of these statements being made depends only on the true value of 6/9 for the particular 
Contrast for which the statement is made, and on the constant 5. | Е 

If the true value of 0 is zero, the probability that we make the desired statement (i) is 

Pr(—S«'(r: 0)< S]. 
able with r n.r. This may also be written 


($) (8)! 8, = 0-70 would also be 


where ('(v; 0) denotes а central / vari 
Pri, v) <(k= 1) КЕ 1v). 
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here F(1, v) isan F variable with 1 and v D.F. That this is at least 1 — æ is evident gi 
bo | 2 x Б « Ж robs nu 
confidence statement associated with (5). Indeed, as 4 increases from 2, this probability 


increases rapidly from 1 — а as indicated in Table 7, where 1 minus this probability is tabled 
for the case v — oc. 


Table 7. Probability of not making statement (i) about a contrast whose 
true value is zero (у = оо) 


| k 2 3 | 4 Б] 6 8 | 10 13 16 20 | 
a | | | | 
d 


= | | = | | 
0:10 | 0-10 | 0:032 | 0-012 | 0-0053  0-0024 | 0-0353 | 0:0313 | 0-017 | 0-0523 0018 
0:05 | 0-050 | 0-014 | 0-0052 | 0-0021 | 0-0388 | 0-0%18 | 0-0%39 | 0.0545 | 0.0557 | ШО 
0-01 | 0-010 | 0:0024 | 0:0876 | 0-0327 | 0:0310 | 0-0117 | 0:0532 | 0-0931 | 0:0732 | 0.0818 | 
| | | 


For any contrast 0 the probability that we make statement (ii) is 


Pr (у; 0]o,) > S). (18) 
This is a strictly increasing function of the у 
normalized; we then write it as 9. 
9 = 99, we have 


alue of 0ја;. Suppose now the contrast is 
and have 25 = Со. Writing P for the probability (18) when 
P = Pr{t'(v; 99/(Co)) > S). (19) 
From the tables of non-central / | 


»y Johnson & Welch (1940) we сап find the value of 
8-% (бо) for which P attains the values 0-01, 0-05, 0-1 (0-1) 0-9, 0-95, 0-99. . 


The probability of making statement, (iii) for any contrast can of course be calculated as 
the probability of making statement (ii) for its negative, 


In the example introduced above of the experiment with 6 means, C =n., If we 
take n = 10 and use significance level æ = 0-05, then у = 84, g2 — БЕ, " 84) = (3-41), 
and we find from (19) and the Johnson & Welch tables that for P = 0-90 0 = 4:72, 80 
Әу = 47200 = 1-495. This poor sensitivity is related to the extremely low тікі (0-0009 іп 
Table 7 for у = оо) of making the other kind of error, namely, calling significantly different 
from zero the estimate of à contrast whose true value is zero, We su Ы а is this example 
that we desired the n that would give a P 8 rete E 9. біпсе for 
large у the parameter 8 found from the tab ch “ith vie for a first 
approximation for n the solution of до (Се) = (1-0) nt = el cy An 


5 У жы = 6 with the fi = 472, This 
же n P ualde of 9, from the tables for n = 1, 22, as t cedi 10, shows 
at n = 22 is the required value. In the same wa; d th F 
3 X y we find + i = 2-00 for 
Р = 0-90, a first approximation to the required n is n = a wie 
also gives n = 


2 " ly: 
2 that the desired stat values are zero, and second 


; made for all normali?? 
) “ited statement (iii) would also be made f? 
o). 


«ы 
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We shall state here the results for the probabilities Р, and P, and defer the proofs to the 
next section. It is obvious from ($) that if the hypothesis Н in (5) is true, P, = 1—а. If 


H is false 
P, = Pr(F(E-2,v) & (Ё— 1) (k— 2) Еф 1,9), (20) 


where F'(m, v) denotes an F variable with m and v D.F., and Е, (т, v), its upper « point. It is 
Interesting to note that P, depends only on whether H is true or Die and not further ж men 
unknown parameters. It is easy to see that if H is false P, > 1 — а. Table 8 gives some values 
of 1 — P; we remark that for k = 2 there are no zero contrasts other than 0-4, + 0: i if H is 
false, and so 1 —P, — 0 if H is false, trivially. We see now that this analogue of the 
Neyman-Pearson risk of a ‘type I’ error, namely, the probability 1 — Р, of failing to make 
statement (i) for all zero contrasts, is the one the method controls in a satisfactory way 
rather than the marginal probability for a single zero contrast considered above, which 
assumes the microscopic values indicated by Table 7. 


Table 8. Values of 1 — P, if H is false 


| I 


D) 3 4 5 6 8 10 їз | 16 20 
a у | 


0-10 5 | 0-040 | 0-056 | 0-065 | 0-070 | 0-077 | 0-082 | 0-086 | 0-088 | 0-091 
7 | 0:038 | 0-053 | 0-061 | 0-067 | 0-074 | 0-079 | 0-083 |0086 | 0:089 
10 | 0-036 | 0:050 | 0-058 | 0-064 | 0-071 | 0-076 | 0-081 | 0-084 | 0-087 
20 | 0-034 | 0-047 | 0-055 | 0-060 | 0067 | 0-072 | 0-077 0-080 | 0-083 
40 0:033 0-046 0-053 0-058 0-065 0-069 0-074 0-077 0-080 
го 0-032 | 0-044 | 0-051 | 0:055 | 0-062 | 0-066 | 0-070 | 0078 | 0-075 


0-05 5 | 0-019 | 0-027 | 0-031 | 0-034 | 0-038 | 0-040 | 0-042 |0044 | 0.045 
0-018 | 0-025 | 0-029 | 0-032 | 0-036 | 0-039 | 0-041 0-042 | 0:044 


7 
10 0-017 0:024 0:028 0-031 0-035 0-037 0-040 0-041 0:043 
20 0-016 0-022 0-026 0-028 0-032 0-035 0-037 0-039 0-041 


0:025 0:027 0:031 0-033 0:035 0:037 0:039 


40 | 0015 | 0-021 
0-029 | 0031 | 0:033 | 0035 | 0-036 


со 0-014 | 0-020 | 0-023 | 0-026 


0-01 | 5 | 0-0036 | 0-0051 | 0-0060 | 0:0067 | 0-0074 | 0-0079 | 0-0084 | 0:0087 | 0:0089 
7 | 0.0033 | 0-0047 | 0-0056 | 0-0062 | 0:0070 | 0:0075 | 0-0080 | 0-0083 | 0-0086 
0.0030 | 0-0044 | 0-0052 | 0:0058 | 0-0066 | 0-0071 | 0-0076 | 0:0080 | 0:0083 
20 | 0-0027 | 0-0039 | 0:0047 0:0052 | 0-0060 | 0-0065 | 0:0070 | 0-0074 | 0-0078 

0-0044 | 0-0049 | 0-0056 | 0-0061 | 0-0066 | 0-0070 | 0:0074 


40 | 0-0026 | 0-0037 
со | 0.0024 | 0-0034 | 0-0041 | 0-0045 | 0:0052 | 0-0056 | 0:0061 | 0:0063 | 0:0067 
be 


es of all the normalized contrasts. If 9) > max. 


Let 94. denote the largest of the true valu 
define the angle y from 


the probability P, is trivially 1. If 99 Pmax. 
siny = Sol? max (21) 
is the probability that 


Qa) 9, cosec y, (22) 


Then it will be shown that in this case Р, 
(сову) да (Sy 0800 у) Xe < 2 + ( 


Where у, уз, 2, are statistically independent, y; and у» are y variables with k—2 and v D.F., 

Tespectively, and 2; is a standard normal deviate. The probability P, does not seem exactly 

calculable from any existing tables, but an excellent approximation* may be obtained for 
* Johnson & Welch (1940) used a similar method to approximate non-central 2. 

7-2 
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У йе E 
moderate or large values of v by replacing v-*y, by a normal variable z, with mean 1 үлі 
1 1 " D «Бала va er 

variance 1/(2v). A similar approximation would not work so well for Ха» since its a t 
H ^ р 7 
k — 2 0f n.r. might be quite small. This way we find P approximately equal to the probability 


that 202 (E —2)- ул, 
where 2 = [1 + (20) 1 А cosec? y]-* (=, — 2, cosec у + 5 созес y) 
is a standard normal deviate and independent of Аз; 


д = [1 + (2v)7! S? совес y]-! [(Co)-! 9, cosec у — 8 совес y]. (23) 
ty = [1+ (20)! S? cosec? y]-! (k — 2)! coty. (24) 

To this approximation P, can thus be expressed in terms of the non-central t-distribution as 
Pr{t'(k—2; 8) » 49), (95) 


where д and t, are given by (23) and (24). In the ease where ø is known we can put у = 09 
in (22), so that the left member becomes (cot y) Ху + 8 cosee y; we may also put v = co in 
(23) and (24), and the above approximation for P, then becomes exact. | 

The exact probability P, and its approximation (25) depend on the parameter 9 пах, which 
enters (22) through y. As indicated in the example of the six means, we may be willing to 
specify 2, beforehand as a multiple of с, say 


ду = Ao, (26) 


where A is a given constant, but it is unpleasant to discove 


r then that P, still depends on 
the unknown parameter ‹ 


y => шах/0. (27) 


The dependence of the exact probability P, of (22) on у may be made clear by writing (22) ая 


(PA= + ЗУ А) xs & а + OA, (28) 
where the variables уу, Хо. 2; have the distribution stated below (22). For y > А, D, is the 
probability of (28); for 0 « V € A. P, trivially equals 1. 
obtained in (19) the corresponding marginal probability for a single contrast alone whose 
value = 9, (the non-centrality parameter 6 there may be written A/C), as it does now when 
we consider the overall probability for all contrasts whose valueis >9 (we could write = 90 
here also). 


We suggest meeting this problem by using a low 
Р, = Р,(ү) is monotone in у, decre 
P,(co) as yr increases from 0 to œ. Т 
exceed a known ү, we may use as t 
approximated by (23), (24), (25) with y = aresin (Alp), 
a bound y, for V. or if we are satisfied with | 
calculated bound, we may use Р, 


aie 2 = P(o), 
The limiting value Р,(оо) is the probabili 


This problem did not arise when we 


er bound P; for Р,. We shall prove that 
asing from the value 1 for « A 
his means that if we are w 


he bound P; the value РА 


to a limiting value 
illing to assume y does not 
1), Which may be accurately 
and if we are unwilling to 800 
à somewhat larger but much more easily 
whose value will now be stated. 

ty that 


иу < ДС, (29) 
where y, and y, are distributed as in 


(22). This may ђе асс i " 

В а а ‘ately approx the ват 
method that led to (25), with the result that, P(o) is шы и = 
Ре (2-9; ууру 5o) 
* Itis unfortunate in connexion with (30) and (25) b Е | 


go below 4 D.F., since 1, 2, at the Johnson &W 


t 
Sm. gre sd fori elch tables (1940) do n9" 
extension of their tables. oe needed for jos 3, 4, 5; it would be А 


ren А ге thie 
very desirable to havé 
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where à = (20): [A(CS)-1 — 1], (31) 

ty = (2v (E — 24/8. | (32) 
For very large values of v we may use the following simple approximation to Р.(оо), obtained 
by replacing р-у, by 1 in (29): 

Piriya AC? -S). 
To this approximation we may thus rapidly find for given 2 the A such that Р,(оо) attains 
the value 1 — for 9, = Ac by taking the square root ofthe upper £ point from the X? tables 
for k—9 D.F., SAY Хр and computing 

А = С(8 + Xy;1-2)- (88) 
A still rougher approximation might be obtained by replacing S by its value for v = оо, 
namely, y..,. ,, to give 

А = C(Xs:k-a + Xpik-2)- (34) 


The approximation (34) becomes exact when c is known; this is obtained by replacing & by 


9 and v by oo in the calculations. 

То illustrate these results in the example we have carried along, we recall k = 6, C = n=, 
a bound Р; for P, and decide to use P, = Рдоо). If we take 
n = 10, then у = 84, and entering the Johnson & Welch tables (1940) with 10 = 7:61 from 
(32) we find that a P,(oo) = 0:90 (to the approximation of (30)) is attained for à — 10-8, and 
this gives A = 1-98 From (31), that is, for 9, = 1-980. The approximation (33) much more 
quickly gives А = 6:206 = G-20n— or A = 1-96. The approximation (34) gives А = 1:93. 
То find the n which gives а Р.(60) = 0:90 for ду = 1:00. we first use the previous approxima- 
tion from (83), A = 6-20n-*, which is calculated with an S based on the now wrong v = 84, 
but then S is not sensitive to changes in > for large v. This gives n = 38 for A = 1:0; the more 
Correct formula (30) also gives n = 38. For most applications we would probably have to 
Compromise on a larger A to get a more feasible n. In a similar way for бла б, Wm 39, 
о = (-05, С = 5-і, we find first S = 3:50, then A = 2-81 from the approximation (33); the 


More accurate (30) gives A = 2:87. 


% = 0-05. Suppose we wish 


5. DERIVATION OF А, Py, ETC. 


The probabilities P, and Р, can be found neatly by continuing the geometrie approach 
9f $2. Extending ‘the notation there, denote by || the length of the vector 7, 
á x o 


ly | == Since the value # of а normalized contrast is the projection of оп d, it is 
2. 


. 1 
evident that [7] zx d csse (35) 


L0 (equivalent to H false). If we imagine marking off the 
to get a polar co-ordinate graph of the totality of 
Values of the normalized contrasts, We see the graph consists of a sphere which has the 
Vector у as a diameter. The zero-valued contrasts for which we desire to make py pnm 
a dia oa > : P 
(i) constitute the (I — 2)-dimensional set of vectors d in the tangent plane P to the 
а А is pictur in an obvious way. 
Sphere at the origin. If = 0 this picture collapses in an о | 
av origin. 1] uh бі 5 4 
Consider now is totality of normalized contrasts whose values are > % , 2 which = 
Wish to make the stateméttt (ii). There are none if |n | «9. If |7 |> до they fi г 2. 
Sone % (by ‘cone’ we mean throughout one парре of a cone) with axis along 7, vertex 


Suppose for the moment 73 
Projection of jj on d for each unit vector d 
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origin, and elements making an angle 37—y with 7, where y is defined by (22). The cone 
©’ obtained by reflecting € in the origin is 
are € — 90. 

In the same y-space the estimated value 


filled by the normalized contrasts whose values 


s of the normalized contrasts have a similar 
graph, the diameter of the sphere being 7. From the interpretation of the confidence 
interval (13) for the value 9 of any normalized contrast das the projection of the sphere S on 
d, it is easy to see that if || < 806 the statement (i) is made for 
| 7 | > 806 the normalized contr: 
axis along ў, vertex at the ori 
7, and the normalized contrast 
of Z in the origin. 

If 5-0, the event 6, of probability P, 
F on the plane F covers the o 
so that the vector 7 lies 


all the contrasts, while if 
asts for which we make the statement (ii) fill a cone 2 with 
gin, and elements making an angle arccos (SCG/| 7 |) aie 
8 for which we make the statement (iii) fill the reflexion 7 


happens if and only if the projection of the sphere 
rigin. It is convenient now to rotate the axes in the y-space 


along the positive y,_,-axis. Denote the new coordinates of 7 
~ ~ 
by б“ 5 бер and of 7 by £,,.. 


^ ^ 

“быр respectively. Then Ers ба, 62 will be independent, 
the ¢ will be normal with variance C2¢2. the means ¢,...,€,-» will all be zero, while 
Са = |7|. The plane # now 


has the equation Ур 
$ 


K-2, NE 
FP from the Yr-1-Axis is ( ру 8) 
1 


only if this distance does not 


-1 = 0. The distance of the centre of 
‚ and so the projection of. on 2 will cover the origin if and 


exceed the radius of Ж. 
k-2, 
X Besos, 
k~2, Ы 
Py Ci/(k—2) 


or 


atter inequality has the F-distribution 
riving (20). 


7] an angle y defined by (21) 
of its centre from the y, |-ахін 


е radius of the в here equals 6, tan Y- 
Thus the event 4» happens if and only if и d d 
CEA 3 А 
( X 8) +080 sec y < была tan y. 
If we now divide this inequa] y Ч 
in connexion with (22). quality through by Co tan У we get the desired result for P, state 


The monotone behaviour of Ру i 
Picture of the sphere P and the | Pria ee 
Tt will be simpler to give here an anal ем 
Јоу) the conditional Probability, 
suffice to show that fo: 1 
since совес у = VIA 


£gested by the above geometri? 
blished by geometric arguments 
g the inequality (22). Denote Ыы 
quality is satisfied. Clearly it W r 
reasing function of ү for y > 4,? 


where № = tany +B 


See y, (36) 
В = 40-1 i 


=S, 
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Because of the statistical independence of үү. д». 21. the conditional probability f(y) may be 
found by treating y, as a constant and working with the joint distribution of ү, and 2, in the 
21. Xi plane: f(y) is the amount of probability in this plane above the z,-axis and below the 
line (36). We shall drop the subscripts from y, and z, in the rest of this calculation. The 
?-intercept of the line (36) is — B cosee у, and the probability f(y) may thus be expressed as 
the integral 


Ју) = [ 


© 
JE 


py(z) 0(2, у) dz, 
соѕес y 


P(X) d'y. 


ztany+Bsecy 
where g(z.y) = 
(5,7 


Јо 
and p,(z) and Pe(X) are the densities of a standard normal deviate and of a у variable with 
1—2 D.F., respectively, and do not depend on y. Now 


© 


fis — pi( — B совес y) g( — В совес y. у) 0( — B совес у)/бу + [ » p(z) [09 (2. y)/0y] dz, 
4 — B созесу 


and ĉg(z, у)/ду = palz tan y + Bsec y) Q(z tan у + B sec у)/ду 
= (авес? y + Bsec y tan y) py(z tan y + B sec y). 


But (- В соѕес y. y) = 0, and hence 


ЈУ) = p (zsec?y + B sec y tan у) p(z) pa(z tan y + B sec у) dz. (37) 
4 —Всозесу 
ПВ<0 the integrand is > 0 on the range of integration and then clearly Ј (у) > 0. If B>0 


We transform the integral (37) by the substitution w = zsecy+Btany to get 


Ју) = T p(w eos y — В sin y) py(wsin у + B cos y) dw. 
—Bcoty 


By utilizing the explicit forms of the densities p, and р, this may be written 


ro)= | NEC dw, (38) 


Where v(w) = Ри) we, 


Disa positive constant, and (39) 


u(w) = wsin y +B cosy 
із positive forz > — B cot y. If we break the range of integration in (38) up into the intervals 


(— Всођу 0), (0, B cot y), (B cot y, со), and drop the last, where v(w) > 0, we get 


res fo "[o( — 10) + ои) de. 


0 


(40) 


For 9 <> < B cot y, u( —w) is seen from (39) to be <u(w), hence | v( —7) | <v(w) since k> 3, 


an i : i is > 0 and therefore f’(y) > 0. | 
im er im Р,(оо) we revert to the geometric picture. To make у — co 


; imiting value ~ 
Жоқары Pini N с. For |y | > ду, we recall that Р.) is the pu p 
the sphere lies inside the cone-7. The cone Я may be pie ees by soit ыы А 
With vertex at the origin about a sphere 7 with centre at] 1 | on : e жа 4. 
As |y | оо this picture becomes indeterminate. However, we ped 4254 бет адга 
if We hold the sphere 2 and the joint distribution of 6), ..., G1. 67 fixe 
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f the circumscribed сопе 2 go to —c on the y,_,-axis. The limiting figure for the | 
of the y у i а= 
4 is now the cylinder circumscribed about 7 with elements parallel to the y, y-axis 
probability РАС) that the sphere 7 fall in this cylinder is the probability that 


[k- 2, Và 
(28) «ese«o,, 


1 


Division of this inequality by Cc yields the desired (29) and concludes the derivations. 


The writer is indebted to Mr Seiji Sugihara and Mr Judah Rosenblatt for making the 
numerical calculations. Tables of Merrington & Thompson (1943) were used in the 
caleulation of Tables 1, 2, 4 and 8, tables of Pearson & Hartley (1942) for Tables ! and 
6, tables of May (1952) for Table 2, tables of Merrington (1942) for "Table 4, tables g 
Hartley & Pearson (1950) for Table 5. tables of "Thompson (1941) and the W.P.A. tables 


(1942) for Table 7, and tables of Karl Pearson (1934) for Table 8. 
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THE ESTIMATION AND COMPARISON OF STRENGTHS OF 
ASSOCIATION IN CONTINGENCY TABLES 


By A. STUART 


Division of Research Techniques, London School of Economics 


1. INTRODUCTION 


In testing the significance of an observed association between two characteristics in a con- 
used. When, however, we wish to measure strength of associa- 


tingency table, y? is generally 
arson’s coefficient of contingency, 


tion, some text-books recommend the use of Karl Pe 
а function of X? which may be referred to the conventional range (— 1, +1), although it 
cannot attain the limits of this range. Kendall (1948а, chapter 13) gives references to the 
work on the coefficient, largely by Karl Pearson himself, and points out the difficulty of 
ariance on the hypothesis of independence. 

Yates (1948) proposed a test based on scores for association of characteristics of the kind 
Considered in this paper, but its distribution, as he points out, is only obtained on the hypo- 
, and it becomes progressively more inaccurate as the degree of 
association increases. Williams (1952) has recently considered various tests based on scores. 

In this paper, a coefficient is proposed which is independent of scoring systems. It is, in 
fact, a suitably modified form of Kendall’s rank correlation coefficient, which depends only 
on ordinal properties. It is shown how the existing theory of the coefficient may be used to 
estimate the population association, to set confidence limits for it, and also to test the 
difference in the coefficients caleulated for two contingency tables. 

Since the proposed coefficient depends on rank order. a condition for its applicability is 
that the rows and columns of the contingency table fall into a natural order. This is, in fact, 
Benerally the case. From one point of view t he proposed coefficient may therefore be 
regarded as availing itself of more of the information supplied by the contingency table than 
does the contingency coefficient, which is invariant under interchanges of rows or columns. 

The confidence interval and the significance test obtained by use ofthe proposed coefficient 
are conservative, in the sense that a lower bound is provided for the confidence coefficient, 
and an upper bound for the probability of wrongly rejecting the hypothesis tested. This is 
а consequence of the fact that only an upper bound is known for the sampling variance of the 


Coefficient, and this is in general a poor upper bound, although sometimes attainable. 
: er than no test at all, and this test should be useful in 


ample sizes are large. 


obtaining its sampling v 


thesis of independence 


However, a conservative test is bett 
many practical situations, especially where s 
2, CONTINGENCY TABLES AND RANKINGS 
at his rank correlation coefficient t may be calculated for 


а contingency table whose rows and columns are ordered by the criteria of =: 
An rx table with a grand total of n is regarded as two rankings of n objects accor pe T 
Characteristics for one of which only r separate ranks are distinguished, and for E eo 
only в separate ranks. Looked at in this way, the marginal totals are simply the ae po 
Objects tied at each level. There a one-one correspondence between any con ingency 


table and a pair of rankings. 


Kendall (1949) has suggested th 


is thus 
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The original context of this suggestion was the estim 
> 

correlation parameter in the case of non-normal у 


i ' associati ' contingency 
coefficient ¢ should not be used in its own right as a measure of association for conting 
tables in which there is a natural order for rows and columns. 


ation of the product-moment 
: ч б M 
ariation, but there is no reason why th 


3. THE DENOMINATOR OF | 
The test commonly used to test the significance 
no ties in the rankings, is described by Kendall ( 
table, it is made with reference to a ћу 


of an observed value of t, where there am 
19480). Like the X? test for a contingency 
pothetical population obtained by the SS 
of sample observations. In calculating the coefficient, we may therefore use Daniels's (1 ee. 
definition of £ in the universe of sample permutations. If Ti ту are the ranks allotted to t 
ith and jth objects (i <], and ар, б; are scores, one defined on each ranking, given by 


b.slt*l (трет, 
“Pt \ 1 (қт), 
Daba 1 
this definition is [e 95 (1) 


(Уа)! (хы) | 

The summations in (1) extend over all unequal suffixes. The property of the coefficient 
=—l<t< +1 (2) 

is seen from ( 1) to be a conse 


quence of the Cauchy inequ 
attainable only when the me 


ality, the equalities in (2) being 


ore 
mbers of each set of Scores are all +1 or all — 1, When the 
are no ties, (1) is equivalent to the more normal definition 
22098 (3) 
n(n—1)’ 


where S is a function of the number of i 


nv 
When we come to consider rankings с 


ersions formed by the two r 
on 


taining ties, we must furthe 
Q5, bij = (ғ; = 7), 

and alternative forms are now possible for 1, for 
the denominators of (1) and (3) 
association exists, the dise 


the test may ђе carried out directly on the num 
association, it ig of some 


conventional range (—1 


ankings. 
r define 


(а) If, when ther | А 
sent, wi 
(3), we obtain the coeffici сонин 
b) Tf, on th t 
es of de у b tis denoted by t,. The r 
compared to the untie is decreased in the дей 
equalities 1 2 В OWs from th : ali that 
Vm It = ж in not attainable by t, except in the ida. ee аг 
n 
и numerator but al : inator #88 
: жі, rtain] Д arger denomin e 
deficiency of the ii Wi m This is analogous to the sim? a 
: ere ber of ties is large, the deficieD?" 


m 
ue to use n(n — 1) as the denominator, 88! 
all (1948b) as ty. 


(1), the coefficien: 
he denominator 


а 
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will be serious. Since we are particularly interested in contingency tables, which often have 
large marginal totals, we must consider whether a more appropriate form of denominator 
exists. 

(с) Consider the maximum (positive or negative) numerator Xa;b;; which can be produced 
by a sample of x observations arranged in a хз table. This will be attained when all 
Observations lie in cells in a longest diagonal of the table, and the frequencies in these cells 
are equal, or as nearly equal as possible. Any move away from this position will decrease the 
total score Na пу. A longest diagonal contains m cells, where m is the lesser of 7 and s. 

Ifnisa multiple of m, the maximum sample score is 


= (венн) (е). (4) 


m 


(If n = m, we are back to the untied case, and (4) reduces to n(n — 1), as it should.) Thus 


(4) is an attainable upper bound for Xa;;b;;. 
When э is not a multiple of m, (4) remains an upper bound, but cannot be attained. This 


is a quite unimportant deficiency in practice, when % is often large and m very small, and the 
residue of n to modulus m necessarily small. 


Backes 28 
We therefore define & = ик > Т) mL)" s 
п т 
т т 
n—1 m 
whence Qc 6-1 (m—1) la: ы 


It follows that |, can sometimes attain, and for large n can generally almost attain, +1. 


4. THE SAMPLING PROPERTIES OF t, 


Remembering the correspondence between a pair of rankings and a contingency table, let 


Us consider the cell frequencies of an 7 х 8 table, 
fa бет les) 


аз having been sampled from a population r x 8 table with frequencies И. Theni 


1 
Жа-1) ХаубаЙ/л 


ta = 


1 
A Elta) = $5-1 Хаби E(fafg). 


Which by the ordinary formula for sampling from a finite multinomial population, becomes 


1 
n(n— 1) 
1 


= N(W-1) Хауба ћу = То» 


n(n—1) 
Хау үү 1) Кубу 


(7) 


as is t, for the sample. Thus t, is unbiased 


where 7, is defined for the population exactly 
for Tas 
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ХУ g 8 ss t 7 ева і ікігі і tends 
Now Hoeffding (194 ) has shown that for large №, the s mpling distribution of t а : 
Ке š 5 Е Я ева 
to the normal form аз n increases, and its sampling variance obeys the inequality, first giv 
о А | е 
Ђу Daniels & Kendall (1947) for the untied case, 


2 8 
Waly тај (8) 
If we now define, analogously to (6), 


(У—1) m = 
Чая М (m—1) “ 
and use (6), (7) and (8). we find 


El) = h-ny. 


n (М— ђе 


-— 2 (2-1) т 2 (n— 1) NO | 
Ану [ n (N-1)"| |, 


which for large N and n reduce to 


Kit.) = T., 
(te) = т, | (9) 
z m | | 
уат 5-1(-- =. 
т |\т—1 


Аз usual, we may estimate 7, in the sampling variance by |. 
The net effect of using t, is therefore to multiply the у 
by m[(m —1) approximately, and increase its sampling 
property of unbiased estimation of its popul 
(9) may be used to estimate 7 


"^ association 
alue of our measure of = 
2 : hile 1ts 
variance appropriately, w hile 
ation correspondent 7 


becomes asymptotic. 
о and to set conservative 


confidence limits for it, Similar y 
i [ 2 ап 
tive procedures are forced upon us by the fact that only г 
i i Ч 5 ав 
ent сап be obtained. Daniels (1950) В: 
Е; В Tw lover- 
‚ though sometimes attainable, is in general a poor one. - 
| TM eU «for 
; even these conservative limits are close enough together 


5. Tur CALCULATION Op 5 


Arrange the columns of the table so th 


80 that they are ordered in the same ве 
at the top left corner of the table, 


For each cell in the table, multiply its frequency: 
(i) Positively, by the frequencies of all cells lying below 
(ii) negatively, by the fr lls lying 


at they are ordered from 


left to right, and the rows 
use from top to bottom, T} 


; ‘origin’ is 
lat is to say, our ‘origin 


6. EXAMPLE 
Tables 1 and 2, based on case records 


eus of the еуе-і 
factories in 1943-6, have been Constructed res dio 
Association of Opti itioners me 
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| First, we calculate the £ t, association coefficient for each table. Using the method described 
in $5, we find that for men (Table 1), S = + 2,480,223, so that 


For women (Table 2), similarly, 5 = + 13,264,256, so that 


Table 1. 3242 men aged 30-39: unaided distance vision 


ке Left eye Highest Second Third Lowest Total 
| Right eye | grade grade grade grade 
| = | Мк Есе 4 
Highest grade 8 | 112 85 35 1053 
Second grade 116 494 145 27 782 
Third grade 72 151 583 87 893 
Lowest grade 43 | 34 106 331 514 
Total 1052 791 919 480 3242 | 


Table 2. 7477 women aged 30-39: unaided distance vision 


CES | к 
Left eye | Highest Second Third Lowest | Total 
Right eye grade grade grade grade 
Highest grade 1520 206 n = рае 
Second grade 234 15 E. iv EC "A 
Third grade 117 Baz ia in E | 
Lowest grade 36 82 16 f g | 
Total 1907 | 2222 2507 841 7477 


These values are very close together, and in the ordinary course of events we would not, 
Perhaps, bother to test their difference. For illustrative purposes, however, we may carry 


thr rough the calculations. 


Our estimate of the upper bound to the sampling variance of /, for men is, by (9), 


5545 2 {(#)— (0-629)? = 0-000853, 


and for women, similarly, it is 


2 _ 0(8)2 — (0-633)2) = 0-000368. 
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i i im: j appropriate 
The values of the coefficient, and their difference, with the estimates of the appr ions | 
i i imits hase бө arc 

maximum standard errors, and the conservative confidence limits based on two stand 


errors, are therefore as follows: 


| Maximum 2 S.E. conservative 
| t, standard error confidence limits 
| Men + 0:629 0-029 0-571 to 0-687 

| Women +0-633 0-019 0-595 to 0-671 

| Difference 0-004 0-035 = 


Tt follows, as we antici 
shown to be significant 
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A SEQUENTIAL TEST FOR RANDOMNESS 


Bx P. G. MOORE 
University College, London 


l. The problem frequently arises of deciding whether a sequence of observations, each 
Observation falling into one of two alternative categories or types, occurs in a random order. 
David (1947) has considered the 'group' test in some detail. The hypothesis, Но, that we 
àre considering here states that there is randomness within the sequence against an 
alternative, H,, that there is dependence of the kind found in a simple Markoff chain. 
David's test is based on the number of groups of a common type which the observations 
form when placed in order in the sequence. 

In this paper we are going to deal with another form of this procedure which leads to 
à sequential test. This has obvious advantages in that the amount of data which must be 
collected in order to obtain a decision is not fixed beforehand. It may be that the first few 
values will suffice to make a decision one way or the other and only in cases which lie in 
between will a long sequence of observations be necessary. This form of procedure may well 
mean a large saving of time, money and materials. 

в we write 1 for the happening of a certain event and 0 
h event, which may either be a 1 ora 0. The theoretical 
rkoff chain. The general principles of such a chain 
1938) and the notation that we will use is that 


2. In the sequence of alternative: 
forits negation. Let E, represent the st 
model envisaged is that of a simple Ma 
have been discussed at some length by Fréchet ( 

Pr(E, = 1) =P, Pr{B, = 0) -0. 

Pr(E = 1| ва = Пер, Pr = 0 | В. = 1} = go 

Pr{H, = 1 | Ба = 0 = Р Pr{E, = 0| Е, а = 0} = qd» 
Where P+Q=1, т+ћ=], + = 1. 
The hypothesis of randomness is a special case 
of the two possible results of a trial are the same 
= р» = р (say) and hence, of 


These equations define the general set-up. 
of this Markoff chain when the probabilities 
whatever the result of the previous trial. This means that р, 
Course, 4) = др. 

It is necessary to make some 
assume that the sequence’s start 
also obeys the same probability 

Pr(E, = 1) = Pri 
Where Æ, is the event chosen to start the sequence and Ж, 


assumption concerning the start of the sequence. If we 


is at a randomly selected point in a longer sequence which 


model as has been given, then 
= 1, = 1}+Рг{Ё, = 1, E, = 0), 
is the previous event. Hence 


Р = рР +220, 
= Pü-p)-»Q giving Р: = Рад. 
We will denote the hypothesis of randomness (specifying a particular value of p) by H 


1 iven earlier as H,. Then it is known 
апа a Markoff chain set-up 45 а нити ree = Peo if there е an even number 
(David, 1947, p. 335) that, under the assumptions nr , 
Of groups (24), then Pr {t| ry re Ho} = E070 Der (1) 


(nag, 26 (82 f (2) 
9202 7g, Hj) = Kk "E Oper 710145) ' 
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У n i i £. " are constants such 
d r are the numbers of 1’s and 0's respectively and А and k’ are const 
er, and ra 2 | Қалеке 
mes Pr {28} Е 1, where X denotes summation over all possible values of / 
that {20 = 1, à 
For the case of an odd number of groups the 


e с icate "there 
xpressions are more complicated. If tl 
are 2t+ 1 groups, then 


Pr {t | пут, Hp} = (30,730, а) 


Й 1 P Q — 
ы [Poh AAC „+ Érni ry “| ( 
Pr ran Hyg = ===] | —п “> 4-1 П 
Pr(t| путу Њу Џ (=) Е 1 1-1 d; | 
iliti sible values of Lis 
Гала Г being constants such that the sum of the probabilities over all possible v 2 3 
unity. Moore (1949) has shown that for (large the distributions of Pr {i} under eithe а 
Н, approximate to one another. That is to say, that whether there be an odd or even = oses 
of groups, the distribution of Pr{t| H} is approximately the same, Hence for the purp » 
of this paper attention will be focused on the case of an even number of groups, sues 
mind that the results, at any rate for a large number of groups, apply equally wel 
case of an odd number of groups. €— 
It is necessary to find the values of k and ЈУ for use in equations (1) and (2). This is n 


ТЕГЕ ET „| and 
easily done by considering the case, { = 1, and finding the exact probability under Hp 
Н, by simple enumeration. We find that 


(3) 
k = 2рър, 

Р 0 (4) 
b^ = РЕД > 
is inc Е "m ` 


3. То test a hypothesis Ay against a hypothesis Њу, with 
it is true апар of rejecting 


. "реп 
a risk а of rejecting Hy when 
Н, when it is true, the 


лајв of 
Sequential test procedure consists 
calculating at each stage of the Sampling the likelihood ratio L. 
If 
is =e (5) 
x 
We accept H,, whereas if L« É " (6) 
—а 
we accept Ay, andif L lies in between the values in ( 5)and (6) we Continue to take observations: 
The likelihood ratio in this case is 


La PEM | пут ny 
Pr (t Ins; т», Hy} 


D 
|+ | 

= —MP qj in (n ў 
рта “ea | 

У 

and valid for alues of the variables. We өс” 
here under Нур = 1= 0.5. Under А, we will t t 
cular value o ; 


si 
ther than 0-5, Tt will be noticed th 


" ow 
sider е! 


t eration of runs above and b е 
18 Important that the value used as median shoul M 

ees 1 

icking out a departure from random! 


о 
Proportion of ру and 0%. Fr 


(7) 
This is a perfect] 


propose to consider 
Р =Q = 05 and p, 
this simplification h 
the median value in 


all у 
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§2 we have that Ра = ду because Р = Q and hence р, = 4; since р, +q, = 1. The expression 
for L may now be re-written under these simplifying assumptions as 


1 1 
к 2га 1 ЧА 


ЖҮН R-292L-1 
= Rapp 


(2p,)F-*(2 — 2р,)%-1, (8) 


Where R = т та. 
Thus L, in this partieular case, is seen to be independent of the actual values of 7, and 


"2 and only dependent on the total length of the sequence. 

_ 4. Under the sequential system observations are taken for so long as L satisfies the 
Inequality BI(1—a)«L« (1— f)Ja. 

Expressing this inequality in terms of logarithms and substituting for L the expression 
Obtained in (8), we get that observations are to be taken for so long as 


Yi 
log E < (R — 2t) log 2p, + (2t— 1) log (2 — 2p) < gf, 
—а 
which in turn may be written as 
1-5 1—8 2-2 | 
юв E (2-2p,) < Blog 2p, + 2tlog -5 < log | а à-25)j. (9) 


The test procedure may now be carried out graphically. The horizontal axis of the graph 
Corresponds to R and the vertical axis to t. Then if we draw the relationship given in (9), 
replacing the inequality signs by equality signs in turn, we obtain two parallel straight lines. 
Then, provided the point (2, /) falls inside these two lines, a further observation is to be 
taken. ТЕ the point falls below the lower line then we accept the alternative hypothesis, 
Th, if p, is greater than 0-5 or the null hypothesis, Hy, if p, is less than 0-5. Similarly if the 
Point falls above the upper line we accept H, if p, is greater than 0-5 and Н, if p, is less 
than 0-5, 

5. The whole procedure is best illustrated by means of an example. The data are taken 
rom Brunt (1925) and consist of the annual rainfall in inches at London for each of the 
Years 1813-1912. To carry out the test we have described it is necessary to decide how to 
Make the division of the rainfalls into two groups. After 16 years the median rainfall, that 
is, the average of the eighth and ninth rainfalls when the 16 are ranked in order, is 23-87in. 
Tn Table 1 each year is given the symbol 1 if the rainfall is above this median value and the 
Symbol 0 if it is below the median value. Only the first 45 years are given in Table 1 for 


®conomy of space. 


In this example we will test the hypothesis of randomness (р = q = 0:5) against the 


alternative that p, < 0-5, taking as the particular alternative required to define the pro- 
Cedure = 0-3 has "5 are testing whether wet years tend to be followed by dry years 
Or not i ы 2 d # will be taken as 0-05, and hence our two equations for the critical lines 


E Rlog (0-6) + 2tlog (5) = log (45) (1-4), 
Rlog (0:6) + 2tlog (5) = log (19) (1-4). 
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Evaluating these equations numerically we obtain the two parallel lines 
R-—3-3174t— 5-1054 = 0, (10) 
R-3-3174t + 6-4228 = 0. (1) 
The lines given by (10) and (11) have been plotted in Fig. 1 and the figure completed by 
putting in the observed points (R,t) for R greater than 16. Note that the first sixteen 


Table 1. Rainfall at London in inches 


Year Rainfall Year Rainfall Year Rainfall 
1813 23-56 0 1828 2788 1 25:85 
1814 26-07 1 1829 95:39 1 Ба 22-65 
1815 21.86 0 1830 25-08 1 1845 22-75 
1816 31:24 1 1831 2716 1 1846 26-36 1 
1817 23.65 0 1832 19:82 0 1847 17.70 0 
| 1818 23:88 1 1833 24.78 1 1848 29:81 1 
819 26-41 1 1834 20-12 0 1849 22:93 0 
| ud 22.67 0 1835 2434 1 1850 19-22 0 
| ed 31:69 1 1836 2742 1 1851 20-63 0 
| TE 23-86 0 1837 19-44 0 1852 35:34 1 
| 18 2411 1 1838 21:63 0 1853 25:89 1 
| i 3243 1 1839 27-49 1 1854 18:65 0 
ise mod ' 1840 19:43 0 1855 23:06 0 
pe pe 1841 31-13 1 1856 22:21 0 
0 1842 23-09 0 1857 22:18 0 
аллы лға т, АН aaa 
12 
Accept H, 
Х---ж---ж---і 
10- ! 
2 Гете 
5 ee us .| Continue 4 
SL | sampling 
> Х---х- --х- -X-- 2 e" 


f 


Џ 
6%-—-х---х---х---х__) 


26 28 
Value of R 30 


32 34 36 


е 
ups of Ty, so ай В = 16, 2% = 12- үү 
Ris equal to 36 when / = 1 BR 6). We see that the point rena” 
= Запа it goes outside the upper line. Thus А 

т 

бері the hypothesis that wet Y°” 
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We have already noted that since Fh states р = q = 0-5, it is essential that аз the sequence 
progresses a check is made that we have about equal numbers of 175 and 0’s. This is to 
prevent the discriminating power of the test being blunted by a wrong choice of the median. 
Іп this particular case we find that after thirty-two observations, for instance, there are 
about equal numbers, in fact seventeen 178 and fifteen 0’s, so that there is no reason to 
question the choice of critical value taken for the median. Had there been a very wide 
discrepancy it would have been best to have obtained a new median value and recalculated 
the value of t. The critical lines would remain, of course, the same as before. 

In the procedure just carried out we tested the hypothesis of randomness against the 
hypothesis that ру = 0:3, meaning that wet years tend to follow dry years or that a negative 
Correlation exists between successive years. A more general problem would be to test the 
hypothesis of randomness against a Markoff chain that was either positive or negative in its 
Correlation between successive events. This means that there would be three alternatives, 
(а) positive correlation, (b) randomness and (с) negative correlation between successive 
events. In this case the problem could be tackled by setting up two simple sequential 
Schemes, The first would be the same аз that already described, and the second scheme would 
be set up with p, = 0-7 (say). If at any stage in the sampling the first scheme told us to 
accept the hypothesis that p, = 0-3 we would do so, and similarly if the second scheme told 
US to accept p, = 0-7 we would do so. However, we would only accept the hypothesis of 
randomness if both schemes told us to do so. If none of these three results has occurred we 
Continue sampling. Both sets of sequential limits may be drawn on the one diagram, which 
makes the whole procedure relatively simple to carry out. An example of having two 
Sequential schemes superimposed on the one figure has been given by Armitage (1947). 
He was concerned with testing whether the mean of a sampled population was equal to 
Some specified value against the alternative that it was either larger or smaller than the 


Specified value. 


6. It is useful to have a knowledge of the expected sample size associated with any 


Sequential test before starting the test procedure. Unfortunately, as we are discussing a case 
dependent of one another the usual formulae for estimating 


able. It is, however, true to say that the sequential 
ations in many cases, since frequently decisions 
fixed sample size of the usual methods of 


where the observations are not in 
the average sample size are not applic 
Procedure will lead to a saving of observ 
are made on fewer observations than with the 


testing, 
Tos pie that has been derived and illustrated here has been concerned with the case of 


P= = 0-5, Tt is clear from the derivation in $3 that the procedure could be used for 
P unequal to q by suitable adjustments. It would also be possible to base the zx а 
Markoff chain principle under which the observation depends not just on the imme iately 
Preceding observation but on the two preceding observations. This case will, it is hoped, be 


vestigated in а later рарег. 
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ON THE MEAN SUCCESSIVE DIFFERENCE AND ITS RATIO 
TO THE ROOT MEAN SQUARE 


Bv A. R. KAMAT 
University College, London 


1. INTRODUCTION 


Let z, (i = 1,2, -..,) denote а sequence of 7 normal у 


: = n 
ariates with means д; and а commo 
variance c?. If all means и; 


S р кыне À 2 is of 
are identical, i.e. if д, = д, then the best estimator of g? is 0 
z (x;— т)? В А CS . ‹ ving’ 
course 8 = У 7$ 75 Situations, however, arise in which И; тау have a ‘slow-moving 
іші 
continuous trend, and for such situations two alternative estim: 


ators of с have been con- 
sidered. They are: the mean-square successive difference 


8% = "у itea)? 


i=l n=l ” 
and the mean successive difference 
si 
ag.: |ti— ti | 
а= у ul 
11  n—1 


Von Neumann, Kent, Bellinson & Hart (1941) have given a brief historical review of these 
statistics and their use in ballisties and in astronom: 


statistic 8°, As to results about d, Arley & Hald (1 


its efficiency, * 


obtained by Nabeya (1951, 1952), and also derive: 
first four moments of d can now be evaluated. T 
the approximate distributions of the mean succe 


mean square, viz. d/s, based on their first four moments, 


2. Tue DISTRIBUTION OF THE MEAN SUCCESSIVE DIFFERENCE 


2-1. Preliminary results | 
/ 80 that 2,%,.. 
andard deviation 


Let us assume that И; = 


"T san with 
constant mean y апа st; „Фа 15 а sample from a nor mal population 


: с. We shall use the following notation 
(e Tes 47 A = [a | (--1,9,., n—1) 
D т-1 n—1 | 
= PE = РУ |а; “Til. ) 
Then the mean successive difference is | 
da P o €t d; "S | —, 
= e а | 
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It is clear that д; are normally distributed with zero mean and that 


о(д;) = 4/20, Plòi ip) =} (р=1) | 
= 0 (p>)).| 


To find the first four moments of D (and therefore of d), it is necessary to evaluate the 
following expectations: 


é(d,), 
Eldi), Eldido), (3) 
Eldè), Eldido), «(4,4,4;), 
Ed), (84), ©), 60445), 68,44), (944341). 
These are either the ordinary moments or the absolute moments of the multivariate normal 


distribution of 61,0», ду ду. All of them except the last, viz. %(4,4,4,44),сал be evaluated from 
the results given by Nabeya (1951, 1952) and Kamat (1953а) for the bivariate and the 


trivariate absolute moments, and are as follows: 
2 

ут” 

Eld?) = 20°, (dy dy) = (28+) а“, 


Ela) = 


4 bm 1 
6(83) = Jos, &(did,) = 0“ E (d, фуву) = вон) 03, (4) 


5 
ул , 

3 2123), Ed) = 60t 
«(а8) = 1204, даға) = (^^ , «(азаз) = 60“, 


с 12 


. As regards 4(4,4,4,44), two methods were used to find its value. The first is to use the 
Integral form given by Nabeya* (1952). The actual evaluation from his formula of this 
absolute moment, for the four-variate case with a general correlation pattern, involves 
Considerable difficulty. Even in our particular case, where three of the six correlations are 
ero and the remaining three are equal, the evaluation is quite elaborate. Using this method, 


404 f e У УМА |а, Б 
г@„ааад - 5 [|| лака ее PIi- р) ee et (5) 


= Pog = Pas = — b Pis = Ом = Pra = 0. Carrying out the 
n be reduced to a sum of nineteen integrals of the type 


Where i,j = 1,2,3,4; +] and fis 
differentiations in (5), the expression ca 


[ | | | dapi tpexp[ - 358 ру ада, (6) 


* We are indebted to Mr S. Nabeya for sending us the manuscript of his paper (1952) which contains 
А ebte . у 
this formula in advance of publication. 
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1 p<+2andl+m+n+p = 0, —2, – 4. Eighteen of these integrals can 
where —1«l,m,n,p« | 
be evaluated analytically. For instance, 


| ее ехр[– 22:5 — Ур,,1,1]4,9,44, 


-% со | 
e 2 wv , 
= ІС [55 = [= 2008 Урада СД | 
0 —o 


=], ра | 


where 


by = 63 =, = 1, pig = Poy = pu = 0. 


This is further simplified as 
|| e 2 2 2 
ШЕСІ [55 (4 + к. where А = (b,—p},) (1 = Pia) — Рз 
| -i 
= бл? [а — Pia) (3 — Pia — Pie Pis pls)! ры ар 
0 


1 
— (845 +83 tan T) Қ 


415 
т =n = p = — 1, however, leads to 


fe (eu 
Ju „== Ма –аз) 
which had to be evaluated b 


у numerical quadrature, and its value was found to be 0:328988- 
Substituting in (5) the values of all these nineteen integrals, it reduces to 


The integral (6) with 1 = 


1604 а 3. т 

é (dy 4,4. а.) = m bs amu bina H 
= 160*(1-366622)/5? = (2-215484) ot, (8) 
correct to at least five 
This result was che 
of correlation coefficie 


places of decimals and, 
cked by finding E (d, d, 
nts given by the p 


possibly, to six places. 
а,а,) using the ex 


— 5 ers 
pansion in series of pow 
resent author (19534, 


Pp. 28-30), viz. 
1604, 
4(4,4,4,4,) = xti + рур ру + 245, — 2p}, p%, 


9) 
ЖАУ + ХрурыРаРа +.) ( 


“(а,4,4,4,) = 1604(1-366574)/ле — (2-215406) g4 m 
Which coincides with the result above to first four signi 


ficant figures, 
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ни бин O 2:2. Moments of 4 
E(D) = mé (dy), 

E(D?) = mé (d3) + 2(m — 1) &(d, ds) + (m — 1) (m— 2) &%(4,), 

E(D?) = mé (43) + 6(m — 1) (084) + 3(m — 1) (m — 2) 6 (d3) E (d1) + 6(m — 2) &(d, ds аз) 
+ 6(m — 2) (m — 3) (d, ds) (dı) + (m 2) (m— 3) (m — 4) &%(а,), 

&(D*) = mé (dà) + (m — 1) (080) + 4(m — 1) (т 2) & (di) E(d,) + 6(m — 1) 6 (d1d3) 
4-3(m — 1) (m — 2) &*(d3) + 24(m — 2) 6 (dd,d;) + 12(m — 2) £ (d, 0343) 
+ 24(m — 2) (m — 3) 6 (аза) 6 (di) + 12(m — 2) (m — 3) 6 (d, ds) 6 (43) 
+ 6(m — 2) (m — 3) (m — 4) £ (di) &*(d1) + 24(m — 3) 6 (dı d, d5d4) 
+ 24(m — 3) (m — 4) E (d, ds d;) &(4,) + 12(m — 3) (m — 4) 6*(d, а.) 
+ 12(m — 3) (m — 4) (m — 5) 6 (d; dy) &*(di) 
+ (m — 3) (m — 4) (m 5) (m 6) &4(d,). 


(11) 


Substituting the values of the various terms, the results (11) lead us to the following central 


moments of d: 


la = Žo = 1.128379 
ут = 2 б, 


8 а. 15 = 
8 4/3 12 1. 2 443 Че 
8 п m \3 п m? 


mi 
= (1-052264m-1 — 0-325504т-3) 0", 
Аз = (non^ 160+ 24 үз— 96/3 720057115} zr 
> (n 192 +48 /2— 14443— місе) т. 


= (1-642670m-2— 0-87059873) 03, 


ва = неее | У 10007-0024 2104-09} 
3 7T m? 43 


(12) 


167? 5 
—8n(20,/3-- 74) - ^5 247 а | т?т? 


1 
+ [т6(14 а +468 —432 сова - 38049) +87(21 4/3 + 256) 


+ 4g? — 72а? =} g^, where «= <(4,4,4,4,)/6% = 2:215484 


= {3(1-052264)? т7 + 1:119816т-5- 2:040188m-*) 01. 
2.3. Approximate distribution of d 

nd //,— 3 as m (and hence n) tends to infinity. Table 1 
‚ = 3(1) 10, 15, 20, 25, 30, 40, 50. The „у, В, points lie 
Type Ш line. If we may assume that a Pearson 
ments will represent the distribution of d ade- 


t is clear from (12) that > 08 
p the values of o(d), 1 and Въ for 7 
„ће Pearson Type I region, just above the 

Уре I curve with the correct first four mo 
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quately, it is relatively easy to obtain approximations to certain percentage points. The 
upper and lower 5 and 0-5 % points for the appropriate Ди, J, values were found by inter- 
polating in Pearson & Merrington’s (1951) tables of standardized percentage points. For 
n = 4(1)8, the 2-5 and 1% points were obtained by interpolation in Thompson’s (1941 а) 
tables of percentage points of the Incomplete Beta-fünction, first finding the appropriate 
p = Капа 4 = фу, from the standard relations connecting the Type I parameters with the 
Ву f constants of the distribution. Finally, for 2» 8 it was found that a Type ІП curve 
with the correct first three moments was indistinguishable for our purpose from the Type 1, 


and the 2-5 and 1% points were therefore found with the help of the tables of percentage 
points of the y?-distribution (Thompson, 19416). 


n 
Table 1. Standard deviation, B, and фр, values for djo = У; [а-а |[{(®— 1) е} 
іт 


т S.D. А, Bo n S.D. f Bs 
| с E E HE 

3 0-6669 1:0356 4-2962 15 0-2711 | 0-1638 3-2028 
4 -5609 0-7253 3-8940 20 -2334 +1210 3:1499 
5 4927 “5547 3-6857 25 -2080 0959 3.1188 
6 4443 -4484 8-5548 30 -1895 0795 3-0984 
7 0-4078 0:3760 3-4655 40 0-1636 0-0592 3:0733 
8 -3791 +3237 3-4008 50 -1461 0471 3-0584 
9 +3556 -2841 3-3518 

10 -3360 -2531 3-3135 

== 


Note. Mean d/o 


жана ee = 1-128379. The fourth significant figure in f, and the fifth significant figure in Ёз 


Table 2. Percentage points for the approximate distribution of dl 


Lower Upper 
Е ___ 
n 0-5 ý 
1 2-5 5 5 9.5 1 0.5 
Er NET 
ВЖ 0-09 0-12 . a 
4 17 12 ЗЫ 0-27 2-41 2-75 3-16 3-48 
5 23 -28 37 2.18 2.44 9.76 2-99 
6 28 33 86 D 2-03 2-96 9.53 2.72 
ER 49 1-94 2:14 2.37 2:58 
7 0:33 0:37 у 
8 37 41 t 1-86 2-04 2-96 240 
9 40 44 53 "B7 1-81 1-97 2.17 2.30 
10 +43 47 BB “60 1-76 1-91 2.09 2-21 
i. ҚҰ 63 1-73 1-86 2-03 2:14 
р 0-58 0-6. 5 
20 61 ce al 0-72 1-60 171 1-84 1.93 
25 *66 -69 15 a 1-53 1-69 1.73 1:80 
a0 69 73 78 83 ү 1:57 1:66 11% 
; 1-59 1:61 1 
40 0:75 
50 is 0-78 0-83 0-87 1. 
81 -86 41 1-47 js 1:59 
У) 1-38 1-43 eem 1-53 
For n= 
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For п = 3 all percentage points were calculated from the exact distribution given in 
$ 2-4 below. АП these results are combined in Table 2. Certain tests of the accuracy of 
the approximations are given in the following sections. 


9-4. Comparison in the special case т = 3 

No direct comparison between exact results and the Pearson Type approximation is 
possible except in the rather exceptional case ® = 3. Here, by linear transformation, the 
probability integral can be expressed in terms of the bivariate normal integral and can be 
shown to be (see Kamat, 19534, р. 31). 

Рио <d} = 1420р. 40024,9) — рау, 0) + p 43) до 0) —P yal (5) во 0), (13) 

h о ро 1 
where р, (h,k) = (27) (1 -e | | ехр - (eg озу | ахау. 
adk 2(1— p?) 


(1931) Tables for Statisticians and Biometricians, 


This function is tabulated in Karl Pearson's 
age points for 


Part 11, The following comparison for the upper and lower percent; 
D = (n—1)d - 2d 


Shows that the Pearson Type I fit is adequate for practical purposes*. 


Lower Upper 
1 9.5 5 5 2.5 1 
Integr: 0-24 0-38 0-53 4:82 5:49 6:31 
тек зет cw 0-26 0-40 0:55 481 5-47 6-29 


9.5. Sampling experiment 

An extensive sampling experiment was carried out with the help of Hollerith punched- 
card equipment. 25,000 Hollerith cards, each bearing а random normal deviate from 
Wold's (1948) tables of random normal deviates, were available. Thess cards were sorted 
Into 2500 random samples of 10, Ху (= 12 10, = 1,2, v 2800). On a Hollerith 
Senior Rolling Tabulator the following 6 then carried out. From each of the 


500 samples of 10 the ten progressive sums 


|Ха—Ха|+| Ха- Хеј "s 


abulation was 


10 
Х,-Х, 
|Ха-Ха |, Pt ji zal 
i ; f the next). 
Were formed and printed (taking Кулу of the first sample as X4 19 he next) | 
_1<9, the particular sum 2 | Xj — Жива | provides 


v 
о that for each of the sample sizes 2,3, ..., 10 (i.e. 
cessive difference sum, D, were 


values of the suc ; 
o provide 1250 independent sampling values of D for 


For any sample size 1 € 10, ie. ? 


an independent sample value of D, 8 
des = 1,2,...,9), 2500 independent 
available, The same tabulation can als 


в table was sent to Press show equally satisfactory agreement at the 


TM Caleulations made since thi: 
er and upper 0:5 % points. 
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16 may be mentioned here that formulae (16)-(19) are exact and that (18) and (19) are 
obtainable from the first two moments of the non-central x "m 
Now, for a ‘slow-moving’ shift in the mean, X(A0;f|o? will be considerably less i — 
7(0;—60) Јо“, and therefore the formulae show that the bias in estimating т is —— 7 
greater using 82 than using ô? or d. Again, the increase in the variance of the estimate 


j ЕГ 5 а Рака e bias in 
also larger in the case of s?. Two examples of how a slow-moving trend affects the bia 
the case of the three statistics are given below. 


Example (1). 0, =и+(0-05)50 (2 =1,2,....п = 15). 


The percentage bias in the estimate d = 0-06. 
The percentage bias in the estimate à? = 0-13. 
The percentage bias in the estimate s? — 5-00. 


The bias introduced in estimating c by s? is 40 times larger than that introduced by either 
d or 9?.* While it is negligible in the latter it may not be so in the former. 


Example (2). 0, = H+ (0-04 + 0-00422) (i = 1,2, 9% = 10). 


The percentage bias in the estimate d = 0-24. 
The percentage bias in the estimate д = 0-48. 
The percentage bias in the estimate s? = 6-54, 


Even in this quadratic trend, the bias is 14 times larger using 57, 


3. THE DISTRIBUTION oF THE RATIO OF THE MEAN SUCCESSIVE 
DIFFERENCE то THE ROOT MEAN SQUARE 


3-1. Moments 
Von Neumann (1941) has discussed the exact distribution of the ratio of the mean-squa’e 
successive difference to the variance 


» Viz. 92152, where 


шт) — dd) (21) 
(в)? 

* Tt is to be noted 

9? estimates 02, 
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where ju’, is the moment about zero. [See in this connexion Geary (1936) and von Neumann 


1 
2 11 Substituting in (21) the moments about zero of d and s, the first four moments 
about zero are абара as 


2 
| iu) o eee) | 
| № = үл (5 
| 2 т--1 
| г 2 ) o Cs Пен ші) 
metis в 443-1) 1 (2 _ 1 
т a+ (+ п *) КЕ 5-5 


_ 1 
=~ (1273240 + 1-052264m — 0-325504m-?), 


«c 
1 ы 2 8 1 
= ELE Prec 12) —— 
J rez т тїт 
1 


Б] 
n+ i) 2 
+ | 12п + 208 + 2442— 120 43— 72 cos"! - а sS 
x: s A3/ тїт? 
1 
+ ШЕ 144 өз 192- ав ја — 421) шы 
NL _ 
re "ij 


1 
+ (96/3 — 288 + 647) zx 


22 
(1:43670 + 3-56206m- + 0-54079m-? — 0-870677), p uM 


ETE] 
Hos ERE tty де, 
| 


т(т--9 
1 
qm? 


вал? 4 G4 3m — 807+ 192 2 4-2 2048.— 1152 3 — 516 cos- J) s 


1 
+ (om ttm — 160/37 — 9287 + 4032 cos J 


1 
+ 4512 4/3 — 6480 — 134442) тата 
+ (in тодшта + 1684/37 + 20481 + 2304/2 


1 
+ 7488 — 5760 3 — 6912 057" 5) =| 
s feet 
m(m + 2) 
Where 


{1-62114 +8-088700 + g.24931m-2— 2-80941т- + 2-0414 07), 
"— жай Фе <(4,4,4,4) = 2.215484. 


3.2. Approximate distribution of W = djs 


From the moments about Zero given above, дл, o(W); f; and f, were caleulated for 
15 = 5,10, 15, 20, 25, 30, 40, 50. They are given in Table 4. (It is to be noted that из <9, 


-thero is negative skewness. ) It is seen from this table that the ра, Ёз points lie in the 
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Pearson Type I region, very close to the symmetrical Туре П line. For the онин. 
С = 1—(n—1) 6?/(2ns?), which is linearly related with 62/s?, the d herr 
fitted extremely well, as can be seen from Young (1941) and Hart (1942). T ida ina 
author has verified that the 5% points for 62/s? itself based on the Type п аршка е 
coincide with those given by Hart (1942) based оп the exact distribution within pns ( = 
sometimes three) places of decimals for n > 10. It seems probable therefore thar a F a > 
type curve will give a good fit to the distribution of W = (|в also, for n > 10. Table 5 gi 


Table 4. The mean, standard deviation, Ay, f» values for W = ај“ 


n ГА o(W) | By | Ва 
5 134231 — | — 03001 | 0-009 2-55 
10 1-2229 | 2116 013 2-84 
15 1-1890 -1709 011 | 2-91 
| 20 1:1730 :1470 :009 | 2-94 
| 25 1:1637 0:1311 0-007 2-95 
30 1-1576 “1194 -006 2-96 
40 1-1501 | -1030 +005 2-96 
50 1:1457 | 0919 004 2-98 

| 


Note. иҙ<0. 


Тађје 5. Percentage points for W = а ја" ЕР 
Lower | Џррег 
n - — — ---- 
05 1 | 25 5 5 25 1 Be 
З | — =| 
10 0-67 O71 | 0-79 | 4 1 а 1:73 
15 -74 лв | 85 y de s 1:10 1:60 
20 79 83 | 88 93 | 141 | 1-46 1. | 1:54 
25 . | Б | 
30 =. өе (521 0:95 1:38 142 | 147 p 
88 92 96 1:35 L39 | 4 
40 88 91 95 98 13 29 Ln 1-41 
50 91 93 97 95 32 | 1.35 1-39 1.38 
| | 130 | 1.33 1-36 
| zy 
* Itshould be noted that fo], 2. (вте У 
, НЕ" 9 owing thedefinition used by von Neumann we have taken 82 = 2 <= 
Le. taken a divisor of n, not n=l. = 
the upper and lower 0. д а 1 
Do the rea ней. p ; 5and5% points for W based on the Pearson type approx". 
Tho 24 and 19/ ii. ints were calculated from the Pearson Merrington (1951) tab ИЛ 
* о Or % = 10,1: & о 
(1941 а) tables of the ое О ig calculated with the help of ThomP® the 
Ри à : 5 tor the T А ile 10 
remaining sample ; Е Псотріеіе Beta. i while e 
E 5 ple sizes (n> 25) the normal approxi р i a-function, sat HOF th 
р Pose, was used, Tt may be Temarked that f БОП, Which was sufficien x 
significance point. 5210г po 


8 are necessary. Tt is h 


of the distribution of djo and dis in a la; 


5p 0 
Sitive serial i the © ы 
oped to giv correlation, only 18 


ü 
€ some fi i ions of the 
ter ра, x urther illustrations о 
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3-3. Comparison of the distributions of d|a and d/s 
It is of interest to compare the distribution of the ratio criterion d/s with that of the 
Sleep 4|6. This comparison does not reveal the familiar properties of a studentized 
statistic, as the denominator s, which has been computed from the same sample as the 
numerator, is not independent of d. In fact, d and s are positively correlated since 


ави - 5669 
ea.s) =6 (4s) = # (2) 09 = 2409, 
So that соу (4,5) = PAG —é*s)) = аг (в) 20. 


Та consequence of this positive correlation the ratio d/s has a smaller variance than d/o, 
as can be seen from Tables 1 and 4, and, since &(d/s) > 6 (4 ја), the same holds for their 


"relative variances’, i.e. 
var (d/s)/6?(d/s) < var (d]o)]&*(d]a). 
ә 
Аз ®-> со, &(4[в) > 6 (а ја) = je but 
var(d|s) _ 47--643—21 _ 0.395 
var (d/o) * dn + 643— 18 ч 
Which is less than 1. It may be added that similar properties hold for the statistics 97/02 


and 42/52 mentioned іп § 3:1. 
As will be seen from Tables 2 and 5 
Points of d/o bracket the corresponding perc 


‚ a pair of corresponding upper and lower percentage 
entage points of d/s. 


Finally, I wish to express my sincere thanks to Prof. E. S. Pearson and Dr H. O. Hartley 
for their help and guidance in the course of these investigations. 
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THE EFFECT OF UNEQUAL GROUP VARIANCES ON THE F-TEST 
FOR THE HOMOGENEITY OF GROUP MEANS 


By G. HORSNELL 


1. INTRODUCTION 


David & Johnson (1951) have studied briefly, inter alia, the effect of unequal group variances 
on the nominal significance levels of the F-test (variance ratio) when used to investigate 
differences among a number of group means. The present note takes those calculations 
a step further and also considers the effect on the power of the test in a special case. | 
Ifa total of N observations of a random variable are divided into I groups, with frequencies 
m(t = 1, 2,...,1), where N = Ут, if S, and S, are the usual between group and within group 
t 


sums of squares and if F, is the 1000: % significance level of the F-distribution with и 71-1 


and v, = N —1 degrees of freedom, then the David-Johnson method consists іп approxt- 
mating to the distribution of 


«-&- ns. (1) 
Three types of curve have been used in deriving the approximations: 
(i) the Edgeworth series in the form 
Ш x K(X) S СЕ ~ , LOKE(X а 2 
P = уе дж ga, 109 ТЫ) @ 


where X = (x — кү(ж))/к(х), к (2) 
polynomials; 
(ii) Johnson’s (1949) unbounded curve Sy, and the log-normal curve ч - 
(iii) a Pearson curve, usually of type IV. 
The curves were fitted usin 


is the rth cumulant of z and the H-functions are Hermite 


Set | 
Ы К Kog Koy 

$ i 1 1 " 

с | 1 2 

D 1 | 2 5 2 + 
| Е | i | i 1 | 1 
| В | І | 1 | 2 3 
| | 3 3 


| 


С. HORSNELL 129 


2. EFFECT ON SIGNIFICANCE LEVEL WHEN THE GROUP FREQUENCIES ARE EQUAL 


урин bon an equal number of observations, т, in each group, namely, 5, 10, 15 and 
m eer ively, will be considered first. The values obtained for the actual tail areas cut 
(йө aoe the nominal 5 % and 1% significance levels of Р are given in Table 1. 
тя п а {де case of equal variances is also included to give an indication of the accuracy 
заа ia hod. T he values obtained from approximating to the distribution of x by the three 
Vic oia are in fairly close agreement, the Pearson curve, where used, nearly always 
: ga result closer to that of the Sy than to that of the Edgeworth series. There does not 
Ppear to be any change in the error of the significance level (which in every case is small) as 


the sample size increases. 
Table 1. Equal groups 


а) 5% nominal significance level 
о 


Unequal variances 
à Curvo Equal 
t fitted variances 
Set A Set B Set C Set D Set E Set F 
prm — — 
5 Edgeworth o051 | 0056 | 0056 | 0054 | 0063 | 0-060 | 0-062 
So -052 -058 :058 :055 -069 064 066 
Туре ІУ -- — = = _ 1068 — = 
а= ~ 
10 Edgoworth 0.053 | 0-059 | 0058 | 0-056 | 0-064 | 0062 | 0:063 
Sy 050 -055 -055 -052 063 :059 :061 
Туро ІУ 050 — — — -063 = E 
5 Edgewogth oo53 | оов | 0-057 | 0056 | 0-062 | 0061 | 0061 
Sy ‚ -049 -054 -054 052 +062 +058 :059 
Туре ІУ -- — = == :062 -- == 
ии ком Е РЕСІ _ 
20 Ed, 057 | 0057 | 0055 | 0-060 | 0-059 М 
Zdgeworth 0-053 0:05 0-060 
ig 48 | Фоа | фи | -052 | 061 | 0% | 459 
Type IV .049* — = = :059 = - 


* Value obtained from Pearson type у curve as the Ду Pa point for x lay close to the type V line. 


(b) 196 nominal significance level 


Unequal variances 
n Curve Equal 
fitted | variances | над | бов | 566 | 8р | SetE | бег 
= 5 g 0-010 0-013 0-012 0-011 0-018 0-015 0-016 
10 ue ‘010 013 -013 011 018 015 016 
15 кщ 010 019 019 011 017 014 015 
20 S; -009 012 019 011 016 014 015 
9 
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3. EFFECT ON SIGNIFICANCE LEVEL WHEN THE GROUP FREQUENCIES ARE UNEQUAL 


Welch (1937) considered the effect of unequal variances on the t-test for the difference E 
two population means. He found that for n, = n, = 10 and a nominal significance level о 
5%, the true level lay approximately between 5 and 6-5 % for all values of къ; [Kage о 
т; = 5 and п, = 15, however, he found that the test could be seriously misleading, anc 
showed that the significance of the difference between the two means would tend to be 
underestimated when къ; < ка, and overestimated when ко! > Көз for a given nominal level 
of significance. This problem has also been considered by Gronow (1951, 1953). 

Table 2 shows that these conclusions can be generalized to the case of more than two 
groups. The sets of variances considered are set A and set D. The error in the significance 
level appears to depend only on the ratios of the numbers in the groups, 7,/N, and not on 
their absolute magnitudes. For the error in the significance level to be a minimum, it 19 
necessary to take slightly more observations in the group with the high variance. An 
empirical rule seems to be that the ratios should lie about half-way between equality and 
theratios of the standard deviations in the groups, for the cases considered here. A similarity 
may be noted between this condition and that in stratified sampling, where, to obtain & 
linear estimator of a population parameter with minimum variance, it is necessary to take 


à sample from each stratum proportional in size to the standard error and the size of the 
Stratum. 


4. ЕРЕЕСТ OF UNEQUAL VARIANCES ON THE POWER OF THE TEST 
The David-Johnson method will now be u 


sed to examine how inequality of variance affects 
the power of the test, that is, the probabil 


М ; ity that it will detect real differences between the 
population group means. Again, it is only possible to examine particular cases. For th? 


Standard case of *normal theory?, where Ку = о? for all t, the power of the test depends = 
(а) v, = 1—1 and и, = N —!, the between- and within-group degrees of freedom, and (b) 0? 
| = св ыш аи has been termed the non-central parameter, measuring the дерге? 
rogeneity among the i баер 
ie = > t Xn Md ow Ifthe population group means are ку ( = 1,2, - 
D D АК 
then Tang (1938), in preparing his tables, 


used as the non-central parameter, 


b= (Ert (оу + ууу, e 


Е 5 d with the standard conditi ithin-gro"P 
variances Ky, differ, severa] forms of comparison may be of i йн ш 
below. y be ot interest 


D, i.e. with к, as (1, 1, 1, 3), and with these we 
quencies 7, shown in Table 4 (with En, = 40) е9 


Case (i): €, = €, = 0, but д: к 

А: =. 2 = ©з = C, but О, different, i.e, there is а, divergent mean in a group with Jow 
Case (ii): О, = С, = C, but О, 

variance, Г 


diff i i Ww) 
erent, i.e. the divergent mean is in the group ith wie 
The first step in the investi 


рр gation is to со jde? 
close approximations to the қ nfirm that the David-Johnson method provid 3 


Ower when 
the ka are equal. A comparison is made in Table 


* 
for 
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Table 2. Unequal groups 


(1) Set A (крв: 1, 1, 1, 2). (Tail areas estimated from Sy curves) 


131 


Nominal significance level 
n, Ng пу та En 
t 

5 96 1% 

8 8 8 16 40 0-032 0-006 
10 10 10 10 40 -055 013 
п 11 12 6 40 -077 020 
16 16 16 32 80 0-030 0-005 
20 20 20 20 80 -054 012 
22 23 23 12 80 076 020 

(2) Set D (ky’s:1, 1, 1,3). Ут = 40 
Nominal significance level 

т Ng ту па 5% 1% 

Edgeworth Sy Type IV Sy 
7 7 7 19 0-021 0-017 0-016 0-002 
9 9 9 13 049 043 ек 004 
9 9 10 12 054 049 = 012 
9 10 10 11 0-059 0-056 = 0-015 
10 10 10 10 -064 -063 -063 018 
12 12 12 4 -103* +132 +134 -052 


The tail ordinates of the Edgeworth © 
Part of the range but for the term in К 


urve ha 


xX). 


da‘hump’ in this case and would have bei 


(3) Set D (корз: 1,1,1,3). Ут = 80. (Там areas estimated from Sy curves) 


Nominal significance level 
т та тз та 
5% 1% 
“013 0-002 
13 14 40 0 
У 17 17 29 026 005 
18 18 18 26 042 499 
19 19 20 92 :053 5 
0 91 0-057 0-015 
i 54 a 20 -061 016 
2 4131 048 


еп negative 
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which is clearly satisfactory, and gives some confidence in using the method when the 
variances are unequal. In the further work, the Edgeworth series has been used, except 
where indicated by notes below Table 4. 


Table 3. Equal variances. Y, n, = 40. Comparison of approximations to true 
1 


significance level and power 


| Power 
Significance level | РЕ - 
| ф=1 ф=? | ф=25 
| Е 
True level 0-050 Tang 0-325 0-904 0-087 
Edgeworth series —-053 Edgeworth series +332 901 :986 
Sy :050 Type ІП curve -329 -903 -986 
Type IV -050 Log-normal curve | +328 | == zx 


We have now to consider what comparison may be of interest in illustrating the effoct of 
unequal variances. In the first place, we may compare the power computed as described 
for cases (i) and (ii) with that obtained if Tang’s tables or the Pearson-Hartley charts are 


tered with 
pte b= фи = (Со +), ~ 


where 0$, is the common low variance of the first three groups (taken as unity in the calcula- 
tions). In this way we can judge how, in the cases illustrated, the unrecognized presence Я 


& single high group-variance will reduce the chance of detecting a divergent mean value: 
using the ordinary method of analysis. 


An alternative approach is to replace g? 


in equation (3) by к) = У тко, i.e. to => 
the power function table with , 


9 = ф = (Ок, (v, +1) m 


For small differences among the « 


5 it would be ex ; ould 
provide a good approximation to " expected that the use of к in (5) WOY, 


in зы power. If the power function is to be used: in 
А of within-group variability der; + experienc? 
to explore in advance the size of samples required to ne eae pi aah SEE 

1 


may well approximat 
example we have taken with the Ky having vem (1 1 (гє 


would be expected under these conditions 
Table 4 gives the appropri ko 

priate comparisons. F o 

for conveni E 8. For both cases (i ii ге cho? 
Xn ane 42. iin the ¢, of equation (4) assume exact sa ae i pert Hp give? 

и n “2. 

+ m B H У j 
(i) and (ii) within a cell of the table “rs шь ев а jo ы 


will alter wi ; 
tiis н - the group frequencies in passing 99 


tical experiment wor 
he average variance K2. 
› 1,3) is perhaps more extreme 


he 
tha” 


А thi? 
Pproximations), we see that wit 
case (ii) than for case (i), apart А тё 
or а worth-while power: WE раб 
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been seen already in connexion with Table 2 (2), when т, is well above ог below the average 
Eroup frequency of 10, the significance level is seriously affected for this case D. | 
Жы now comparisons with the power 2(ф) obtained from Tang’s tables, based on 
they E $ = фу. We find again, as we should expect, that for a given set of C, an increase in 
E ы riances of one group lowers the power of the test. Except in the case where ф, = land 
e ‚ате (12, 12, 12, 4), where the actual significance level is much higher than the nominal 
Уо, the power for cases (i) and (ii) is always lower, and generally much lower than blo = Ф,). 


Table 4. куга: 1. 1, 1,3. Ут = 40. Nominal significance level 5 % 


Real Approximate power based on Edgeworth series 
t $ 
™ | significance | - --- 
ov. ing* 
evel using ф=ф=1 ф=дф == ф-фі-?5 
1 шарқы | а = с m 
5 7E 0-021 | Approx. power (i) Approx. power (i) 0-576 | Approx. power (i) 0-835 
Ball s i 0-017 (а) А (шу 541 (ii) 0-768 
4 ype IV 0-016 =¢,=1) i E 0-987 
19 As = 5 = 0-716) фа = 1-432) 0-61 0-82 
i 9 Е 0:054 | Approx. power (i) 0-211 | Approx. power (i) 0-724 | Approx. power (i) 0-918 
3| 2 | Se 0-049 (ii) 0201 (ii) 0-644 (ii) 0-830 
3) 10 Typerv — |26-Ф-1) 0:325 0-904 | 0 = d.7 2) 0987 
12 Жр =ф= 0790) 021 0-71 | 6 =Ф = 1-976) 0-89 
es | ы РА 
l| 19 5 c gc Э i 76 FOX. ИТ 
Е i ox: „їй . Approx. power (i) 0-764 Approx. power (i) 0:936 
3 19 8, goea Approx. power an ый (ii) 0.672 ees (ii) 0-846 
Type IV 0-06: =ф=1 вд == 0: =ф= 255 "987 
4 10 : pes 5% = 5 - 6816) AO = фа = 1633) 074 В(® = фә = 2041) 0:91 
D = 
1 | 19 pu өк 9 i) o c i 7 
2H Я A А ji Approx. power (i) 0 880 | Approx. power (i) 0:979 
3 | 12 S, iud | appen seem (i pprox. port? (ii) 0759 (ii) 0-892 
12 | Type IV 0:134 | BO = 41 =D) 2 0-904 0-987 
ae Ба ae = A = 0913) 0-84 0-96 
=. ш 
real significance level have been used: Е, with the Edgeworth 


* 
Thre imati 1 
Series. S e methods of approximating to the Б Я ve. 
3 Sy, wit! а (10 . type IV, with the Pearson curve. | reda 
~ Only А Johnnon, 8 (1949) muon (A calculating the power, except in the cases marked (i); (1), (8): 
geworth series was in addition to the Edgeworth series to calculate the power here, 


0 Cury T 
t es and the log-normal ге were изе ipo У 
ће Values алын ои [Ij Be ‘0-228, (8); Su 0:245, (ii)s log-normal 0-354. 


ade taking ф = фу Here in a number of 
d case (ii). For low values of the power, 
у a wrong start to the curve 


arisons are m: 


However, more instructive comp 1 
ase (i) ап 


Cases /( ф 3 Р 

= фә) lies between the power for ¢ 1 
бау less than 0-50, its values may be considerably influenced b. wre : 
т. = 0,10 by the difference between the actual and nominal significance levels. But it 


will be seen that as the power becomes large 50 that there is à worth-while chance of estab- 
IShing the significance of differences in means, the tabular power based on the weighted 
Mean variance gives a very reasonable approximation to the true power, particularly when 
© diver 31 опр with lower variance. | 
Comp М... " P oui the power curves as in Figs. 1 and 2 using ¢ = ф as 
abscissa. The curves for the frequency combination (9, 9, 10, 12) have not been drawn in as 
ey fall very close to those for equal groups, (10, 10, 10, 10). The standard curve, with 


Ordinate #(ф), is shown in poth figures by а broken line. 
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one ) (Fig. 1), the curve for frequencies (10, 10, 10, 10) is very close to the standard. 
"him “о n: (7; 1,1, 19) апа (12, 12, 12, 4) lie, respectively, below and above the other two 
ee vit 9, owing to the different values of the significance levels. Allowing for this, 
Қ ы we see in the different slopes of the curves the advantage of having more observa- 
ns in the group with high variance. 

ES. ve (ii) (Fig. 2), the curve for equal frequencies falls away from and below the 
oie ard curve as ¢ increases. The curve for frequencies (7, 7, 7, 19) starts at ф = 0 with an 
nate of 0-02 instead of 0-05, but has passed above the equal frequency curve when ¢ = 2, 

(OR у нн рар! ard. On the other hand, the curve for frequencies 
= "^ it 4), in spite ofits high start at 0- Зов g= 0, shows the lowest power when ¢ = 2. 
шін ifficult to summarize these results in any simple form, but perhaps the following is 
= in atement of the position. In general, where there is no very clear information as to 
ЖА t. heterogeneity in group variances 18 apportioned, it will be best to work with equal 
in Fd requencies. If, however, there are definite grounds for believing that the observations 
the E or more groups have a variance above the average, we must avoid taking less than 
574 eon number of observations from these groups. This is because we do not want to 
ти ~ risk of claiming a significant effect (when there are no real differences in means) 
vin erably more often than has been allowed for on the basis of the significance level 
а. en. Tt will, indeed, be worth while taking a few more observations from the groups whose 
ariance we believe to be above average, and there is no great danger of overdoing this. For 
E nas can be no objection to the risk of wrongly claiming significance being less than we have 
Owed for (e.g. for the power curve to be below the standard near ¢ = 0), if the true curve 


m Н 
akes a good recovery as Й(ф = Фа) increases above 0:50. 


t xh i 
hough it has not caught up with the stand 


5. ALTERNATIVE TEST PROCEDURES 


M Y 
Vhere the group variances are known, the statistic 


L- X o(ky- ћу): 


(6) 
ty of = 1,2, ...,/ group means, where w, = тујКољ 
= хо zy 

ted as y? with 1— 1 degrees of freedom when the 
xion, the L-test has been found useful in com- 
of probit lines (see, for example, Miller, Bliss 


m : 
Я ау be used to test for the homogener 
и = the ИВ group sample mean and ky, 


бі, may easily be shown that Lis distribu 

SA means are all equal. In another conne 

ing estimates of the constants О 
raun, 1939). 

hen the estimates of the group 

Қы tiong w, may be replaced by V à 

po L- > w(kyu— Ку), 


fa number 


re based on sufficiently large numbers of 
.,I) are the estimated 


(7) 


variances а 
= [ku where ky (t = 1, 2, - 


stributed approximately as y? with /- 1 degrees of 


be obtained from the non-central 


the above test will not be very trustworthy. James 
ance levels of x? depending on the estimates 
[-distribution to approximate to the 
ter test. 


wi > қ 
here ki = У Бу др Will then be di 
t t 


freedom, and the approximate power of the test may 


19 stribution, For small sample sizes the 
51) has devised corrections to the signific 
. Stoup variances, while Welch (1951) uses an 
‘tribution of L/. Both procedures should provide а bet 
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The fitting of Sy curves in this investigation was facilitated by formulae due to J. Draper 
(1952). 


The author's thanks are due to Prof. E. S. Pearson, Dr М. L. Johnson and Dr F. №. ы 
for suggesting the topic of this paper and for assistance in its preparation. Acknowled gemon 
is also made to the Chief Scientist, Ministry of Supply, for permission to publish this note. 
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1. INTRODUCTION 
ers from data obtained by the capture- 


The theory of estimating various population paramet 
(Leslie & Chitty, 1951; Leslie, 


recapture method has been discussed in two earlier papers ДӨ 4 у ^ 
1952), with special reference to the types of problem which may arisein applying this method 


ula ў was als , gexperi- 
to populations of small mammals. Tt lso shown, by means of an actual sampling expert 
Ment carried out on an artificial population of counters, that we should obtain satisfactory 
estimates of these parameters, together with their variances, from equations which are 


based mathematically on à deterministic model of the population, provided that the 
andom and that the size of the samples 


Sampling of the individual members is wholly at г 

is not too small. We now propose to examine in some detail the results of an experiment in 
the field, in which two populations of small rodents, namely, the field vole Microtus agrestis 
and the bank vole Clethrionomys glareolus, living on the same area, were sampled by means 
of a live-trapping technique over a period of very nearly two years, between May 1948 and 
April 1950. 

The most important questio 
analysis to any set of field data is whether th 
theoretical development were in any Way realize 
Slven for estimating the different parameters are ba: 
of the individuals was entirely at random, у 
Animals were being caught with equal facility. Iti 


n which immediately arises in the application of this form of 
e basic assumptions which are made in the 
d. Thus, all the equations which have been 
sedon the assumptions that the sampling 
s of marked and unmarked 


and that all classe: 
s necessary, therefore, in the first place 
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to examine a set of data with this point particularly in mind, for if there is a — 
breakdown in these assumptions, then no valid estimates can be made of the p Ны 
we wish to determine. The results presented here will show how dangerous it r% i d 
apply theoretical methods of estimation to unsuitable data, without some such 
iminary analysis. 

оте pai of the present paper is, then, to see whether any of the хаав 
estimation which have been developed оп a purely theoretical basis can be applied to a ве 
of field-data obtained by trapping а population of living animals. We shall not be — 
cerned either with the purposes of the original inquiry, or with the biological or ecologica 
interpretations which may ђе placed on any of the results. These, together with the pee 
why certain technical procedures were adopted in the field, we propose to discuss in greate: 
detail elsewhere. Е да 

Finally, a word as to nomenclature. It is evident that in trapping a population of w Я 
animals we are not obtaining a sample covering all the possible age-groups alive at a 
time. Thus, by this method of sampling, we can have no knowledge of the number of young 
in the nest, and it is not until these young grow up and become active that they gradually 
become part of the trappable population. It would be tedious and cumbersome to refer 
each time to the ‘population at risk of capture’, and it must be clearly understood that when, 
for the sake of simplicity, we use instead the term ‘total population’, or ‘the total number 
of individuals alive at time t’, we mean only those members of the population which are 
capable of entering the traps. In the case of a species with an intermittent breeding Beas) 
however, these members may constitute the entire population on the area at certain times 
of the year. Similarly, for the sake of brevity, we shall use ‘dilution-rate’ and ‘death-rate 


i i ; А : А ioration 
as comprehensive terms which would also include any rates of immigration and emigratio 
to and from the area. 


Since we shall have frequently to cross 
(Leslie & Chitty, 1951; Leslie, 1952) 
form І, 52 and II, $y respectively. 


. . . TS 
"refer to various sections of the two previous pape 
› these references will in future be given in the shorten 


east side of Lake Vyrnwy: 
pruce, which was about ] ft. 
ormed by the lake, a ваге 
m south-east to north-west- 
ntirely self-contained, the only open channe 
ither end, The mature-plantatio? 


owever, did Е е the 
trapping area, not prevent other Species from moving to and from 


The field methods were 
differences being that sa; 
baited for 48 hr, before h 
before, of drawing sam 
Kempson, 1949) 


i m to those used in a previous study (Chitty, 1952), the chief 
es "x Wate Were allotted at random and that traps were P He 
ples which и by these procedures to avoid the error, met W! 


| biased by an excess of marked animals (Chitty > 
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baiting points were determined by means of a table of random numbers, each section being 
sampled in proportion to its area. From May to July 1948 positions were chosen at random 
within sections, all of which were trapped simultaneously, though lightly, between 9 a.m. 
and dusk. On each of the following two or three days the traps were lifted and redistributed 
at sites randomly chosen and already prebaited. In September and October 1948 we still 


trapped on the same system except that the traps were set overnight. In November 1948, 


at the suggestion of Mr M. J. R. Healy, a system of stratified random sampling was adopted, 
one or two positions at random within each 
argin of an incomplete square no trap 
aken, no trap was set; 


the most usual method being to allot either 
10у4. square. Ifa point chanced to fall beyond the m 
Was prebaited, and at points on the area where no prebait was t 


wn at 20 yd. intervals. Microtus was absent 


. Sketch- T ing area with co-ordinates sho 
хан ОРО Raye side of the road are scrub, suitable for Clethrionomys. 


from all hate reas, but those on the lake 
рар Ахавч) ры Е hich Apodemus only is known to have occurred. The 


Fig. 1 


Other hateh н го woods in W 

ed areas are mature woods 1 Д À © өй. 
road is 15%. wide iih 3ft. grass verges which are included in the trapping area. B = blackthorn. 
at each point. The area was now trapped in 
etermined at random. From November 1948 
ited next morning, but from June 1949 


Otherwise either one or two traps were used 
Seoti ә А 
ections on different days, the order being а 


to May 1949 traps were set in the afternoon and vis 


hey were set during the morning and visited twice within the next 24 hr. 
3 ‘ious respects, only two of which need be men- 


This general plan was modified in var { 
tioned, (a) dar the first trapping in March and April 1949 the whole area was retrapped in 
a single operation, one trap being allotted at random within areas of 20 у4х10уа. (0) In 

ау and June 1949 we removed part of the Microtus population which ranged over the 


area 5 d horn in sections 0 and 1 (Fig. 1). 
E e toi ces a A ра nbered leg rings, except in the first summer, 
als were mark 


f nur 
y means 0 vios аиа 
When a clipping or ear-marking was adopted. In going through the maivicu 
records T for believing that any errors were caused by s loss E 
i Е i le taken from the 
rings or other marks of identificatio f the sample take m 


n, except in the case о 
м ion i 1948. 
ehrionomys population in September Ad А. 


Som ; " ffort may be gathered fr Р 
area, Pie da ledio It will be seen that the total number of traps entered, in- 
cover . 


ted. 1 than the number of occasions on 

° uding th but unoccupied, is always less 
10; rung but u А 

which bs а Е "А " - occurred, i.e. when allowance 18 made for the ne vers 

each ir e en reset. These facts indicate that at least one point 1s пхегу 


i іст f at least 
fa ithi imal (individual Microtus ranged over areas ot at lea 
10 len within the range of t с в for traps Was not so severe that the activities 
of сайы, ed serious error into the sampling of another. 
when the traps were lifted and reset 


140 The estimation of population parameters from capture-recapture data 


N 


“лау SEENON EARL 
X OY SONSS WAS, 


289 
we | 8 | LI 


En) porum үздод, \ 
(s) рәзүхгш 18101, 


~ 
= 
- 


"AON 
"удов 

Amp 

ouny 

Керү 

чау 

"ep :6761 
"AON, 

420 
49$ | 
Атар | 
eunf :8Р61 | 


чау "лом | 3458 


Ame | 


ounf 


Avy 


(2) опадао jo yQuoyy 


"лох 00 | "здов 


| 
OYA quor 


(paurquos 55 pun 20) '("ш) po«njdvo 180] әләт hay) әәиле уралојиг IYI ој Burpsovon зат тол вт\уолотуү fo 40704144840: ‘с 9141, 


"0961 tidy. ш GE #8761 1940490 ш 09 :398 эхом Хоца Авр OY uo рәўївгл ospe олом 84913 ourog 4 


"роптшехо иоца A3duro зпа ‘Sunads sdear, w 


synsas Билл) fo hammung "T әче, 


| 

1 + 89& 979 £cl T6 т | £I FI 85 95 &ió чау yc-8I 10961 

1 с £18 6/9 688 £c 1g LI 05 se 061 4% “AON 12-91 

1 1 | 10? 975 9/5 Ра T £c GI oF 161 ae 9995 g—Sny 6c 

t | i 266 тсс erc 85 61 99 о 6 86 T Атар 75-05 

1 1 cog Hu 681 те 5 $6 5 D LII 22 ounf 06-91 

1 0 86& 055 LLI 85 TI LI £ 15 66 0-1 Авр 85-05 

1 0 866 506 81б $5 08 61 $ 75 есі el "dy 95-85 | 

I 0 vic 206 #05 РТ ге 0c 9 | 96 901 ТФ зер 80-06 :6961 | 

I 0 $05 305 сөт ее 0 81 £ |^ c6 19 $4 злом дс-Ре | 

1 + 081 081 961 0& 05 61 е 59 69 91 790 91—&1 | 

1 | 0 958 958 095 се УР £I т T9 св Bl "2498 9-1 | 

0 | T Сбт S61 £cl el ТІ СІ 0 [34 oF е1 Атар 08-95 

0 5 955 985 185 0 IF 65 0 те 101 ounf 61-91 :8761 
— Е - = ы Ms 

| 

Яш хвпр 0} впшәр shou | 
Ж. б saolyg wes 3 зтоло И | (s&ep 8с 
woj | WOON ТЕТЯ pewwqoid | poaojuo | ,Sunads ш офу | -02.04791.) jo spun) | 

————| "jo on | sdvaq Jo sdun ИЕ s3urddea} воувр Surdduag, | 
sou : 1 | 

Фолу yous 0} ем ое | поло poddvay spenpratpur Jo ‘ON E 

вуна JO "ON. 


Р. Н. LESLIE, DENNIS CHITTY AND HELEN CHITTY 141 


at othe: i z Ч 

асын ч on successive days in the summer of 1948 and in March and April 1949. At 

Е ве > the number of these recaptures was minimized by putting the animals first 
o tins (containing a supply of food and bedding) until the traps had been reset 


elsewhere. In assembling the da i i 
ye field trip naa not аш” на о аншы 
Пенья ета Table 1 that the majority of the animals trapped consisted of Microtus, 
зе : hs ý 5 ionom ys and the shrews, Sorex araneus, S. minutus and Neomys fodiens, 
mostly m = TS found at points scattered all over the area. The shrews, however, were 
Ше deae = in the traps and few were released alive. The wood-mouse, Apodemus 
йв hien] | А ively wide-ranging species, occurred in small numbers as an invader from 
al habitat outside the area. The only data suitable for analysis were thus obtained 


from =: 
the Microtus and Clethrionomys populations. 


3. STATISTICAL АХ ALYSIS 


T А 

а took place оп 11-14 May 1948 and the last on 18-24 April 1950, but since 

taken ЊУ he area was omitted on the first occasion, the origin of the sampling chain was 

Unequal j ne purpose of this analysis at 16-19 June 1948. The samples were taken at 
al intervals of time, and although this in no way affects the methods of estimation 


whic} В 

құ die 5 ; 

me us be employed, it is necessary to have some approximate and simple time scale. 

а a а . H +. ? 

i i à parameter such as the survival factor, determined for a particular interval, may 
Xpressed in some unit of t Working in units of 28 days 


ftom, ime for comparative purposes. 
m an origin at 16-19 June 1948, the scale in the second column of Table 1 was found to 
espond satisfactorily with the true da 


айг tes at which the samples were taken. In addition, 
ely arbitrary scale is a great convenience 11 


hatin order to simplify the equations for esti- 
і Ја 5 the various parameters and the actual computational procedure. Thus, commencing 
1 Q М " 
ле 1948, the trappings were carried out at to ty, t2- 


тігі 4 уау and we will label the successive 
= > es with these suffixes, so that the June sample is regarded as having been taken at 
» the July sample at ¢ = 1, and so forth, the chain en 


} ding with the sample taken in 

1950, at = 12. 
ecause a number of animals died in the traps or were purposely removed from the area, 
ds given in the previous two papers 


it is 
in а атр to adapt some of the equations and metho 
rder to allow for these losses.* The corrections which must be made are, however, 


relat; К Poen 5 
latively simple, and the same symbolism with slight modifications can be used as in the 
тег. Thus, the results of each trapping will be based on 


leoret; 

в retical development given ez 

numbers of animals captured, either alive or dead, when the traps came to be examined. 
captured at time t, 


L 
et C, = the number of individuals 
oved from the population, 


ad in the traps or rem 
e, so that C,-d, = = 

be divided into the num 
ћ had been. captured 


indicated by а superscript to the actual 
30 animals were captured of which 4 were 


d, = the number de 


an s 
d В, = the number released aliv 


The б 


sa ; individuals captured 
lously captured, and the num 


ber и, which had not been 


at time t сап 
at least once. Throughout 


ber s, whic 


ж 

hex all the tables given іп this paper, these losses are 

Sithe er caught. Thus, an entry such аз 30% is to be read as: 
т found dead in the traps ог removed from the population. 
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the present work, except in one instance, these recaptures were grouped according to the 
interval of time since they were last captured (Method B: I, $4). Thus, if 
m = the number of animals captured at time ¢ which were last captured at time 2 
м 


t-i 
4 
we have Ута = 85 and 8+ = С. 


We will also define | 


yr, = the total number of animals captured at least once which are alive in the popula- 
tion as a whole at time t, 
and putting Yi = = h, 
then, immediately after the release of the R, living individuals at time /, the total number of 
this class in the population will be y, + у. Then, if 
Ћ = the survival factor over the interval of time Е to t+ 1, 
we expect Р(Џ +) of these to be alive at the time of the next trapping. Similarly, if 
№ = the total number of individuals alive in the population at the trapping at time l, 
N,— d, will be alive at the time of release, of which P(N,—d,) will be alive at time t+ 1. This 


is, of course, to adopt as before a purely deterministic model for the population. Lastly; 
we put 


В, = the dilution factor over the interval t to t+ 1, so that the number of new entries 
alive at time ¢+1is B(N,— dj). 
Thus 


Ма = (Р,+ B) (N,— dj) 
and 


à= Мы(®—4,) = PB. 


3 Ma. 
up to be trapped on 20-2 «ве 


Li 
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Microtus population should have been decreasing in numbers merely owing to the death- 
d and any dilution factors which can be estimated during this period should therefore 
е Zero. 


(i) Preliminary analysis 

À preliminary analysis of the combined male and female population was therefore first 
carried out by a method of estimation which is very similar to that described in IT, $55. 
We there considered a table of та values when the data for a long chain of samples are 
grouped by Method B (1, 54), and for the purposes of rough estimation we utilized only the 
last two entries in each column of the table, and determined the successive values of Р, and 
№ from a series of overlapping triangles of т entries. Although this method is satisfactory 
When relatively large numbers of animals are recaptured in the various classes, a slight 
difficulty was met with in applying it to some of the data analysed here, owing to the fact 
that in some tables certain of the т entries for « = t— 2 were zero, while one or more of the 
entries in the same column for a <t—2 still remained > 0 (e.g. the sample for October 1948 


in Table 2), Under these conditions unsatisfactory estimates of N, and А will be obtained, 


àndamodified method of estimation was therefore adopted, which is described inan appendix 


to this paper. 
In principle this method is similar to that described in II, $56, but at the same time it 


Utilizes more of the information contained in the original table. From the latter we form 
& new table by retaining each т, entry and then summing each column of m,, over the 
ы > Ја А 
values оға = 0 tow = t— 2. Thus, for each value об! > 1, we have the pair of observed values 


i-a 
тыр n = Уш (m+ mpi, = 8). 
== 


animals captured (Cj) and released (2), together 


Then from these entries and the number of 5 
the maximum likelihood 


With the value of y, = u- = R,—5, we first of all calculate 


estimat 

Н j, = = = 1 T Hos, 891 
Tu t+1 

s are given by 


Then, the successive survival factor 
(t = 0,1,2, ...Т-2), 


В = ТІСТЕР) 


ча fcf GRAS EN — 
Instead of this estimate of М, which is known to be positively biased (II, $ 1), we may also 


Use the type of adjusted estimate 3 
Ñ, = $40 DI D 
minary analysis of the present set of data. 


ugh, preli 
ет ulated, which may be done very quickly, 


though we have not done so іп 6 
have been cale 


ae these estimates of В and № 
€ values of A, follow from à =N; 4210—00), 
і liminary analysis of this nature it 
and h anb subtraction. In a pre А 5 
P | those of В; id E s a calculate the sampling variances of these estimates of 
probably mot Бе neos е also given in the appendix. 


p ssions for doing this are als 
The Аар ню methods to the data in Table 2 are presented in Table 3, 


Where the estimates of Yr Ne ^ P, and В, are entered opposite the months of the year in 
stimates p Np Ae 


matio of populat пот, рата 7 fror те- ture data 
44 Th гута 207) meters т, captur recap 
1 ее 


i i thers given later 
ding this table, as well as о g 
i ive samples were taken. In rea е 
bere the = | три be aed that the estimates À, P, and В, зе to ae желе рете 
то f the population at the ) 
а №, refer to the state о pen 
vis dam impr in Tal P, = 0-424 18 the estimated survival factor 5 
taken. Thus, in Table 3, P, = 2 está ee гы. 
idees he June and July samplings, and similarly B, — 0 661 is e тір 
сати ыласын rictly speaking, i/, and № сап 
factor between those of July and September. Although, strictly T al Е у | pete 
е i simal plac 
it i reni о retain at least one decimal | 
integral values, it is convenient to re | | EL 
.. figures which are estimates from those which are known, true values 


1 rs for Microtus from data in Table 2 
Table 3. Estimates of population parameters Jor Microtus fro 


n l x b 
Month t Jy. | Ñ, à | Ê | t 
| т ay 
| = - 0-424 — 
А M | - 
~ Tu 1 40-10 | 1596 1153 | 0-492 | ое 
8 2 34-26 171-8 1:343 0-759 195 
ne 3 7538 | 3281 | 1373 | 0159 0:551 
Nav 4 84-00 2814 | 0719 | 0413 0506 
1949: Mar 5 | 5287 200-2 1-221 | 086 | 0:365 
“Жр 6 | 110-38 949.1 0-927 0626 | 0301 
May 7 | 109-21 220-6 0-646 0-444 0-207 
June 8 66-30 136-1 1-077 0580 | 0.547 
July 9 54-18 1207 | 1495 0817 | 0618 
Sept. 10 8593 | 1760 | 1:080 0543 061 
Хоу. 11 82-00 | 190-0 - — | 


"D 


Р "tant. 
For our present purposes, it is the estimates of B, given in Table 3 which are impor pers 4 
It will be seen that the individual figures remain positive throughout the chain, e of 
we should have expected to obtain, between October 1948 and April 1949, қайны, 
Ву B, and B; That these estimates remain ее 
і ed random sampling of both marked 4 
unmarked animals in the population was not i 


„y t0 
in fact fulfilled, and it is therefore necessary 
examine the results for the winter months in greater detail, 


A method of testing for the absence of dilution in the population sampled has been e 
in a previous paper (II, $ 5а). The method was there described in terms of a populat^ 
which was decreasing over the whole sampling chain through the operation of à Жа 
death-emigration rate, this tate being assumed to fall equally on all subclasses of mat 


Р 1 ere 
iduals in the population. It was also assumed that no deaths W 
od of sampling. In th 


othesis that there wa; 
- These complications 


number of accidenta] deaths is small, and 


ng of thi 
of the o 
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E therefore considered only in terms of these adults. The recaptures in the samples for 
ctober, November, March, April and May were then grouped according to the time they 
were first captured and marked, and males and females were treated separately. The data 


for the two sexes are given in Table 4. 


Table 4. Distribution of Microtus recaptures October 1948—May 1949 according to 
the month they were first marked and released (adult animals only in May) 


Е 1 
| Month of capture 
| 

Month when | | | Total 
first marked Oct. Nov. | March | April May recaptures 

| | ! = 

Males 
Scam E "тат | 
June-September 14 8 7 4 2 35 
October — Бі 4 2 i 12 
November — = 3 2 4 9 
March == — — 141 7 21 
April — — — к 12 12 
No. unmarked 254 29: | 38: 34* 21 
Total catch 39 4 | m 56% 47 
Females 
June-September 9 6 5 | 8 1 29 
October Р == 1 5 | 3 з 12 
November — = 4 1 B 14 
March = | a = ш 1 25 
April — | aut 40 34 16 
No. unmarked E | 18 | E | 
| ------- —— en — ж.” Гг” d | y 
" 1 1 

Total catch 80: | 25 | 54 | 69 39 | 


ct for the type of population we are dealing with, we 
ked animals recaptured from each batch of releases 
tch in each sample (II, $5a). Thus, neg- 
idental deaths, we may calculate the 
e table from the mean proportion re- 
s, originally marked between June and 
ting of à total of 236 animals, a mean 


Now, if our assumptions are corre 
should expect that the number of mar 
Would form a constant proportion of the total ca 
lecting any effects from the small number of acc 
expected number of recaptures for each cell of thi 
Captured in each row. For example, а total of 35 male 


“ptem d in the five samples consis c : 
Stroke te eem тетра ой” number of this class caught in October is therefore 


0148 with the observed number of 14. In a similar fashion, the 
3 x 39=5-78, compared ОҒ recaptures can be calculated, the expected number of 


Temai 
анча. ee їе 4 finally by subtraction. The observed and pcm 
t +. 
Pequencies күнү Б be used to calculate x? in the po ү ү, number = ‚м p 
isti f c columns, Since, owever, the ; 
Sedom being 2с(с— 1) for a table consisting 0: 
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numbers for the individual cells of the present tables are small, we have compared the „~ 
for marked and unmarked animals. The following were the results for the two sexes, the 
expected numbers in each case being given in brackets after the observed frequencies: 


4 
| Males Females 
| Sample | 4 | = a 
| taken i | т 
Poor Marked | Unmarked Total Marked Unmarked | Total 

2 = | - T" | ee 
1948: Oct. 14 (578) | 95 (33-22) | 39 9 (401) | 21 (25-99) | 30 
| Nov. 13 (8-79) 29 (33-21) 42 7 (4:95) | 18 (20-05) E 
| 1949: Mar. > 14 (13-90) 38 (38-10) 52 14 (15-35) | 40 (38-65) | 60 
Арг. 22 (26:38) 34 (29-62) 56 35 (34-05) | 34 (34:05) | 34 
Мау 26 (34-14) | 21 (12-86) | 47 | 23 (28-75) | 16 (10-25) Шы 


For the males y? = 24-7, and for the females v= 


12-8; and since there are 4 degrees of freedom 
in each case, Р < 0-001 and < 0-02 


‚ > 0-01, respectively. Overall, therefore, there is a signi- 
ficant departure from expectation, and in both sexes the deviations show that in the earlier 
samples more marked animals were being captured than expected. а 
These results, together with the non-zero values of the winter dilution factors В, which 
have been calculated earlier, show that an apparent dilution of the Microtus population 
had been taking place during the winter of 1948-9, at a time of the year when no new 
individuals should have been entering the population at risk of capture. Judging from ou" 
previous experience of trapping this species, the most probable explanation of this pheno- 
menon is that marked and unmarked animals were not being caught with equal паг 
though the reasons for this difference in behaviour are at present obscure, For our presen 
purposes, however, the most important conclusion to be drawn from the presence of this 
apparent dilution is that t] ade in the mathematical model are not valid 


i t 
for this popul present during the winter months, we canno 
ay also have b 


(ii) Data for the marked individuals 

The results so far obtai 
important question. Tf th 
not the same, is this diffe 


ned from the ana] 
€ chances of са 
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10 врріу any methods of estimation based on the assumption that the individuals were 
5 randomly sampled like counters іп a drum. 
а = way in which we attempted to answer this question was by considering separately 
- e individuals which were captured, marked for the first time, and released alive 
ween September 1948 and May 1949, only adults being included in this last month. This 
7 oup of releases was chosen merely because it seemed likely that they would have a more 
oe age constitution than that of groups released during the breeding season. 
ee ~ falling into this group, which were released at each of the trappings in October, 
ra > er, March, April and May, were then tabulated according to the number of times 
ж еы Бееп trapped. Since two trappings were carried out in both March and April, an 
eo. id ual animal could have been released in, for instance, March, having been trapped 
пов in that month, apart from its previous history. 
. Then, if a total of т, animals are released at time t, of which a; have been trapped i times 
(бее 1,28, Ya, = ту), and if a total of с, of these are recaptured at time #+ 1, we should 


›&, 


i 
expect, under a hypothesis of a purely random sampling, to find that асл? would be 
al mortality 


Tecovered out of the а, originally released, provided that there was no differenti 
etween the various classes. The numbers of individuals released each month are given in 
ding to the number of times they had been 


Table 5. he distribution of Microtus released accor 
he next trapping of the area. (88 and 99 


trapped, and the number of these recaptured at 1 


А combined : adult animals only in May) 

ШЕ | в | "ГІ! | | | ru 

limes Vies © Re- Caught Be Caught | Re- Caught | 2: Caught 

n 8 lease ‘aught aught || 4 g leased : leased | 

Tapped | (2800 | Мом. lensed | "Mar. | eased | Арт. ive May | way | ше 

1 | 4 " | | 
: | | | 5 5 20 (16: 5 | 140133 
2 4l | 6(61) | 45 7 (8-4 63 24 (28:6) 57 20 (16-3) | 35 | 14 (13:3) 
3 13 209) | 12 3 (2:3) 25 | 16 (114) 33 | 6 (94) 39 | 18013) 
1 = - 2 1 (0-4) 9 5 (41) 16 | 5 (46) | 5 | 1 (1-9) 
5 — == es ao 1 о (04) | 11 | 3 (8-1) | 5 1 (1:9) 
: - = = == 1 о (04 | 2 о (06 | 2 | 0 (08) 
За айба - = |7 Me == 

a 5 ` 29 (29-0 
otal | s | (eo) | 59 | мато| 99 | 4069) | 19 34 (340) | 76 (29-0) 


Expected number of recaptures given in brackets. 


ber of these recaptured at the next trapping of the area, the 
tribution at the time of release, being given 


brackets. Comparing the observed and e7 ted frequencies, it can be clearly seen that 
ere was no obvious tendency for animals which had been trapped a number of times to 
26 caught more easily at the next retrapping. It is to be noted, however, that the ane 
Interval between the trappings is 28 days, and it does not necessarily follow that we shoul 
ave obtained the same result if the intervals had been very much shorter. — 

This evidence, which is satisfactory so far as it goes, is = ы ne йа 
“tilizing only a portion of the data over a limited period of time. n is z =. me 2 
of approach, which is of much greater interest. Suppose we керінді м е ie er 4 ee 
Which were captured unmarked, namely, the u, figures m Table 2, an ES Ды ош И ++ E 
to the marked animals which were captured at each of twelve trappings between July 


and April 1950, inclusive. These represen ain of twelve random samples from the total 
, inclusive. 


Table 5, together with the num 


Expected numbers of the latter, based on the dis 
xpec 


фа chi 
10-2 
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marked population on the area, and we will now regard this subpopulation as + 
only one with which we are concerned. Then, from the information yielded by these — e 
we can, in theory, estimate the total number of individuals in this population, and ~ 
survival and dilution factors within it for each interval of time. With reference to 1 d 
population we can use exactly the same symbolism as before, the total number of individ = 

at time t being N, from which a sample of C, has been withdrawn. Similarly, in subdividi е 
the sample of C, into the various categories, all those animals which had been caught on 
once previously are regarded as new members (corresponding to the unmarked (з) class 1n 


Table 6. Distribution of Microtus re-recaptures according to the interval since they were 
last recaptured (т). Marked population only : 33 and 99 combined 


Month of capture (t) 
Month when 

last captured (a) ] ] | | г. | Ар” 
July | Sept. | Oct. Nov. | Mar. Арг. | May | June | July | Sept. | Nov. A 

| | | | — 

- si = - T | | ———|— M 

| | | | | ей 

1948; July = 3 | = йек — 

Sept. - - 6 1 = == = == | = — = A 

Oct. = = == 5 2 = -— = oe m — = 

Хоу. = = = = | 7T | gg = 1 == || = “1 A 

1949: Mar. | | | 15 з 1 — | — - 

Apr. | 132 9 | — | — = = 

Мау — — | = | = = | = = 16% 2 ЕА Е E 

June | | | ж — — 222 жегі — 

July s = = — fx | = an 2) = 23 4 | = 

Sept. = zs ||| = = s | fce == = es = 29) 9 

Nov. == == | | = == к 

Г ғ | I бенен | et ағы ps |“ 9 
8 - 3 | 6 6 9 | 1 16° | 275 | 24: | 23 35. | 10' 
us 12 | 14 | 17 | MB | 19 | 401 | 333 | 308 | 20: | 39 4T e 

= 1 " | | le 3*3 d | | s2 | 1 

‘otal cate А 12 iT | 98 20 28 | 57 49 57 44 62 | 52 1 
Total released (R,) 9 | т | 28 | 19 | 28 | 2 | 80 

t | 3 8 | 56 | 43 | 44 | 41 | 6 | 


the population as a whole), while all th 
are regarded as old members ( 
whole). These old members с 
represent the re 


need be considered, By di 
the amount of data is, of 


therefore pooled the data 
Table 6. 
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Now we know the actual number of animals which were marked for the first time and 
released at each trapping, and we can therefore compare the estimates of Z, which are 
made from the distribution of the re-recaptures, with the true values. Any marked dis- 
crepancy between these figures, when compared with the standard errors of the estimates, 
would indicate that the sampling of the marked population was not at random, in which 
case no valid estimates of any population parameter for Microtus could be made from this 
Particular set of capture-recapture data. 

The estimates of 2, may be very quickly obtained from the эл values given in Table 6. 
As a first step, we obtain, as before, 

-2 
т A та (+1 = 8). 
х=0 

Then, from the total number of animals ca 
(Ri), and the number of accidental deaths or removals ( 
figure y, = R,—s, we calculate first of all 


ptured in the sample (Cj), the number released 
4), together with the additional 


a Mok 
= Ма ава Или Sp 

Tua 
р = 5/06. ат 1-р, \ 


and from these, as shown in the appendix to this paper, 


А = (Инь бра (4-1,2. 
and, for the variance of this estimate, А 
‚у 5 ери 2 2 
EE sn DH 
; Pur Sia D mM, (44 Mm) & 
The results for the Microtus data are presented in Table 7, where the origin of the arbitrary 


time scale is now taken to be July 1948. The first estimate of Z, which can be calculated is 
for September 1948, the last for November 1949. The estimated total number of Microtus 
marked for the first лө and released into the population over this period is 609-4, whereas 


8, ves 1 =); 


data in Table 6, of the number of Microtus which were 


Ta i rom the Е 
о d and released at each trapping (2) 


marked for the first time 
| үйе | 2 True no. | Deviation | Standard 
Month t Vi 2% released A error of 2, 
2 CN ЕЕ 1: +18:5 
1948: Sept. 1 3-00 із Шо Де шар 
Maw, A 1213 66-8 76 2 93 fd 
"Hà 5 29 а 65 +46-8 +26-5 
Po 29-92 111-8 i65 Tr 
My 5 45-56 19.6 46 E £207 
pon 4 31:00 35:5 8 128 + 84 
e : 24-00 69-5 54 + £106 
4 78: 65 +13:8 + 15: 
i 35-83 18-8 сы +138 ina 
Nev. 35-00 88-0 + 
Nov. 10 е 
609-4 613 
Total | 
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the actual number released was 613, a remarkably good agreement. Comparing the E 
dividual Z, with the true values, it can be seen that on the whole they follow the ae 1 
the latter reasonably well, the worst discrepancies being in the months of April and ие. 
1949. In regard to the errors of these estimates, there is one point to be noted. 1 ће d 
given above for V(Zj) is derived from a set of maximum-likelihood equations which а à 
based on the assumption that the probability of obtaining the observed results at Fee 
sampling is given by the appropriate term of a multinomial distribution. In actual ie А 
we ате drawing samples from a finite population, and іп using this expression, therefore, 
we may tend to overestimate the standard errors of Z, if the proportion of the population 
sampled each time was at all large. 

Now, the true proportion sampled each time, namely, f, = € 1121, is unknown and can only 
be estimated from the data. Thus we have ву И as an estimate of fj, and from this it will be 
seen that no estimate can be made of Îr for the last sample in the chain, upon which e 
estimate M is in part based. Moreover, if n, = 0, which, it will be seen from Table 6 
occurs on three occasions in the present series, then V, = 8, and it would appear that the 
entire population at risk was sampled at time t, a conclusion which seems somewhat unlikely; 
though not impossible, in data of this type. The simplest way of proceeding is therefore to 
obtain a pooled estimate of f, from the available ул, and apply the average correction 1 А 
to each of the individual variances, Thus, from Tables 6 and 7 we have УУ) = 0-694 Ж. 
1—f, = 0-306, from which we have the adjusted standard errors of Z, given in Table = 
A comparison of these with the deviations of Z from the true values suggests that the e 
are of much the order we might expect. Actually Xs*(Z)) = 3007, compared with XA? = 435 ~ 
and considering the approximate nature of the correction term employed, this seems ? 


42. А 1 
sufficiently good agreement.* Thus, by treating the marked Microtus as a population from 


Р 5 Ё · that 
which we were drawing a series of samples, we have obtained estimates of a parameter ШШ 
are sufficiently close to the known, true values for us to conclude that the behaviour of t? 


Р : e 
part of the total population could be regarded as satisfactory, since there was no eviden? 
of any marked departure from the assumption 


-hese 
of a purely random sampling of thes 
individuals. + 


The results of this analysis of the data for the Microtus population on this partiou 
area can be summarized as follows: 


(1) There was a difference in the behaviour of marked and unmarked animals. A portio” 
of the population was being trapped less easily than the remainder, at any rate during the 
winter months, and we cannot exclude the possibility that the eum phenomenon wa "^ 
present during the remainder of the year. This being so, no estimates can be made 9 e 


(2) Once trapped, however, the Microtus 
from our present point of view, 


obtained from the marked population У jol i 
ght have expected if we had been rando! 


et 
trapping living animals. The data for um 
CAMCN адь dii e пе ers too large, owing to the occurrence of three zero V^ ave 
Л = 104/177 = 0-587, and the adjusted Ys2(Z,) = itive oe 
estimate of f, — 0-567 which is made in the i 5 s es 
marked individuals, The present estimates are Seiad 
subpopulation of marked animals which were "m 


* It is possible that this estimate of f, 


at least twice, 
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Microtus may therefore be used to estimate such parameters as the death-rate at different 
seasons of the year, or the expectation of further life of the individuals released at some 
given date. Such estimates, however, will be applicable, strictly speaking, only to the 
trappable portion of the Microtus population. 


(ili) Methods of calculating the death-rate and of constructing life-tables 


As an example of a simple method of estimating the death-rate per unit of time in the 
Population as a whole, we may use the estimates of the survival factors Р, given in Table 3. 
These estimates are based essentially on the numbers in the various classes of marked 
Individuals which were recaptured from the known number of animals released (e.g. II, 
8$ 1 and 5), and no use is made of the observed proportion of unmarked animals occurring 
In the sample of C, taken at time t. This last point is of importance since, if this was not the 
Case, the estimates of Р, would be just as untrustworthy as the estimates of total numbers 
for this particular Microtus population. The values of В given іп Table 3 refer, however, to 
Varying intervals of time, and for comparative purposes, therefore, it is necessary to express 
them in terms of some common unit. If we define 

14 = the force of mortality between ¢ and ¢+dt, which we assume to remain constant 


over the interval # to t+w, we have 


Bae, 
Then Jt, = — log, Руш, 
With Viu) = V(Pd|(wP?), 


the expression for calculating V. (Р) being given in the appendix to this paper. These vari- 
ances may, however, require correction in order to allow for the finite nature of the population 
Sampled, particularly if a relatively large proportion of the marked population was being 
Captured each time. This proportion сап only be estimated from the data, and, as before, 
We have employed the estimated average proportion f, sampled over the whole period. 

и) timates of the total number of marked Microtus in 


hus, in Table 3, the values of у, are es : ome 

he Population at time t. The sum of the available values of y; is 795, and from Table 2 the 
Sum of corresponding s, is 451. Hence, / = 0:567, and we have corrected the estimated 
Variances of Iu by multiplying them by the factor 0-433. The following were the values of 


4 expressed in a unit of time of 28 days, together with their standard errors: 


Death-rate | 

Month pits 
| 0-572 + 0-152 
їз: Тиш | 0-545 + 0-174 
ne 0-184 + 0-143 
ane | 0-217 + 0-154 
e | 0-216 + 0-039 
AS 0-129 + 0-089 
1949: Nar. 0-468 + 0-116 
Apt 0-812 + 0-087 
Мау- | 0-577 + 0-056 
а 0-135 + 0-041 
| = Мне. | 0-218 + 0-011 
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The death-rate per 28 days in the marked Microtus population, taking both sexes to- 
gether, was high during the summer of 1948. From September onwards it fell and appeared 
to remain approximately constant over the winter months until April 1949. Thereafter 
there was a rise in the rate until July, and between September and November 1949 it was 
again of very much the same order as in the previous year. Although no estimate of the 
death-rate can be made for the period subsequent to November 1949, the rate must have 
increased very greatly during the second winter, since the Microtus population was 
evidently reduced to very small numbers by April 1950. 

The relative constancy of the death-rate during the winter of 1948-9 is of some interest, 
since the population is of a more homogeneous age constitution at this season of the year 
than at any other, and neither sex is in breeding condition. In order to see whether there 
was any difference between the sexes during this time, we also estimated the 
rate between October 1948 and April 1949 by the methods of estimation w 
previously given (I, §5B and Appendix 1). The equations given there 
ofa constant survival factor P and an equal interval between the san 
easy to adapt them to the case of unequal intervals, and althou 


solved are somewhat more complicated, the actual method of soluti 
the same. 


average death- 
hich have been 
are expressed in terms 
nplings. It is, however, 
gh the equations to be 
on remains, in principle, 


In order to estimate the constant survival fa 
October 1948 to April 1949, we make use of the number of animals released at and between 
these dates, and of the recaptures of marked animals in the samples taken between Novem- 
ber 1948 and May 1949 inclusive, due allowance being made for any accidental losses or 
removals (I, Appendix 2), and only adult animals being considered in May. The following 
were the estimates of the constant survival factor P per unit of 28 days and the values of 


m. No correction has been made of the standard errors of 
finite nature of population sampled: 


ctor P per unit of time over the period 


these estimates to allow for the 


| Survival factor | 


| Death-rate 
ts 


| hts 
dé | 0-725 + 0-0288 | 0322 + 0. 
CM 0-867 + 0-0289 0:143 + 0-088 
Combined sexes | 0-817 + 0-0182 0-202 + 0-022 


Evidently there was quite a marked difference in the average death-rates of the two 
sexes, The difference between them is 0-179 + 0-052, which would be reckoned as significant 
quite apart from the fact that we are probably overestimating the individ ү у "s 
The average death-rate for the combined sexes is р = 0:202, while + ae, vitet 
weighting the three individual d е о average obtained by 


/ values in the i 
March by the reciprocal of their variances is Л im eG 
The death-rate in the populat: ; 4 


ion as a whole і 
of investigating the changing mortality experi 
owing to the intermittence of its breeding seas 
and complicated changes in the аре constituti 
year. Thus, from the figures given earlier, it i; 


, 


Ж Must suffer a series of Somewhat violent 
4 a of the population during the course of the 
Possible to decide whether the increase in 
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the death-rate which occurred in the summer months of 1949 was due to an increase іп the 
rate at which the overwintering adults were dying off, or whether it was due to a high mor- 
tality among young born during the early part of the breeding season. Indeed, both of 
those factors might have been operating, though in different degrees. A more satisfactory 
way of approaching the problem would be to construct life-tables for each batch of newly 
ringed animals which were released into the population and, if the data permitted, to 
subdivide each batch into males and females, and also into young and old. A detailed 
subdivision of this nature would, however, require much larger figures than those in the 
present series. But, as an example of the methods of estimating these life-tables, we shall 
compare the mortality experience of the individuals released for the first time in September, 
October and November 1948 with that of the releases in March, April and May 1949, 
Considering only adult animals in this last month. These animals which were trapped for 
the first time in the spring of 1949 must have been alive and of trappable age, certainly in 
October 1948, and probably also in September. They were, however, not caught then, and 
in view of the phenomenon of non-random sampling during the winter months to which we 
have already referred, it was of interest to see whether their expectation of further life, 
Once they were caught, differed appreciably from that of the survivors of the groups which 
entered the traps during the previous autumn. 

The various methods of estimating the number of survivors at successive intervals of 
time from a known number of individuals released at some given date have already been 
described (II, $2). Иа, individuals form the group released at a date d, then any recaptures 
of this group in the samples taken from the general population can be recognized by the 
presence of the mark d. These recaptures can be tabulated in various ways, and in the 
present series of data we assembled them in the form of the frequencies Mas, the number 
recaptured at time ¢ which were last captured at time 2 (d <w < t). We then obtained esti- 
mates of the number of survivors (N,) of the original group by the approximate methods 
described in II, 520. There is, however, one further complication in the present data, and 
that is the occurrence of accidental deaths and removals from the population at certain 
trappings. The simplest method of dealing with these is best illustrated by an actual 
example. у 

In March 1949, 76 Microtus (3788, 3999) were marked for the first time and released. 
Of these, one individual was found dead in a trap in April, and five were trapped in June 
opulation. The life histories of these six animals out of the original 
76 are therefore known, and we can immediately write down the corresponding life-table 
(6) figures for them. Thus, six were alive at the time of the trapping in April, five in May 
and five in June, and none after that date. The remaining records for this group are repre- 
Sented by the following table (see p. 154) of та values, in which we arbitrarily label the 
trappings from April onwards as being taken at t = 0,1,2, MM | 

It will be seen that, for instance, out of the remaining 70 animals released in March 
(having eliminated the six accidental deaths and removals), seven were caught in June, of 
Which two were last caught in May and four in April, the remaining individual in the sample 

aving escaped capture since March. The last member of the group was trapped in 

Ovember, and hence the last estimate which can be made of the number of survivors is 
that for September. Since no members of the group were caught in April 1950, the number 
of survivors in November is indeterminate; but we do know that at least one individual was 
alive then, and we shall adopt the convention both in this, and in all other such tables, that 


and removed from the p 


ж = 0, 1,2, ..., T — 1 from such a table has already been illustrated on a numerical example 
т л 
| | | 
* i Apr. | May | June | July | Sept. Nov. | е. ou ~ 
Nes 0 1 | 3 3 4 | 5 | EE, | E; | SR] 1 | | Ne 
x t zl | 
— | _ = 
Apr. 0 = 6 4 = | = — 1586 | 30 35 46-3 6 57 
May 1 -| = 2 1 || = — 5:25 11 14 26-4 5 31 
June 2 - - == Е == || = 5:25 т 1 9-0 5 14: 
July 3 — — = = 2 1 3:00 4 | 3 40 0 Е. 
Sept. 4 | о о Ж 1 40 | 0 4 
Nov. 5 — = == = = == ER + ee 1 | 0 1 
[| ба 
Total caught and 30 11 7 4 2 1 
| released (R,) 
2, — |154773| 1-75 | 1-75 | 1-00 | 1-00 
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the survivorship curve ends at the last date on which any member of the original group was 
recaptured. The method of computing the estimates of the number of survivors at times 


in an earlier paper, the first step being to build up the successive дапа XI, values, starting 
at the bottom right-hand corner. (For a definition of these symbols, and an explanation of 
the way in which they arise, reference should be made to II, $25.) Then 


~ T T 
№, = вд У R+ ЛІ У kaut JA 


t=z4+1 
Thus, from the table above, we have, as a first step, 


Х = 30 x 26/16-86 = 46-3, 


and since from the survivorship curve of the six accident: 


m al deaths and removals, I, = 6, we 
finally have N, = 52-3 as an estimate of the number of ae 


survivors out of the 76 released in 


re à E pping in April. Havi i rie 
of W,, ending with the number trapped at the у cow uale aun 


2 ‚ after allowing for the 
4. : ace at each ordinate y, In J «ample, 
individuals were estimated to have been a] ve at the time of the ber ap aan e 
сша since 4-0 were estimated to have been 
ха, i 
Then, the values of the life-table function follow fend : COP Ro e Е 
EOF yl D. 
4 mber of gro "in 
spem iia and observed Zt, and УВ, in each ec in ты ie dade i 
es for the original data, which are not presented h 8, оғ example, from the separate 
took place in March 1949: USOS RAUM а ‘rapping un 


» 
d 
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| 
Originally | , 
released in ZB, E 
1948: Sept. 19 1 
Oct. 10 4 
Nov. 13 0 
Total 42 5 


From which we have, from the ‘total’ line, 
х 20x43 


А = 59.91 


+5]= 43-7, 


as an estimate of the total number of these releases surviving in March. Proceeding in this 
way, we finally obtained the following life-tables (1,) and expectations of further life, in 
weeks, of individuals alive at the given dates (ez): 


Individuals marked for the first time and released in 
Unmarked 
Month Sept., Oct. and Nov. Mar., Apr. and May animals 
captured 
(и) 
l, е (weeks) 1. e, (weeks) 
1948: Oct. 1-0000 12-9 = ба "m 
Nov. 0:4793 17-6 = = 47 
1949: Mar. 0:2231 12-0 1-0000 9-9 78 
Apr. 0:2001 8-3 0-6882 8-5 68 
Мау 0-1582 6-0 0-4781 7.3 37 
June 0:0639 = 0-2081 = оч 
July 0:0372 — 0-1005 = = 
Sept. 0:0149 = 0:1225 = => 
Nov. 0-0074 — 0-0184 = БЕ 


available that the expectation of further life of the over- 
е caught for the first time in March, April and May differed 
Very little from that of the survivors of those originally captured in September, October and 
November. This being so, we might assume that their previous mortality experience in the 
Unmarked state was probably much the same as in the case of the autumn releases. There is, 
ОЁ course, no way of knowing whether or not this was true. ТЕ the sampling of marked and 
unmarked Microtus had been satisfactory during the winter months, we should have had 
по hesitation in making this assumption; indeed, it is implicit in all applications of the 
Sàpture-recapture method. But, in the present example, we cannot be certain that a 
difference in the behaviour of these two classes was not associated with a difference in 
Mortality experience. If we assume, however, that there was no difference in the mortality 
tates of marked and unmarked animals, we may obtain a rough estimate of the size of the 


16 is evident from the figures 
Wintering individuals which wer 
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Microtus population on the area in October 1948 by applying the /, figures for the MES 
releases to the number of unmarked adult animals which were captured up to May 1949. 
These figures are given in the last column of the table, and by dividing each of them by the 
appropriate І, entry and summing, we obtain a total of just over 1000 animals alive in 
October. If our 1, figures were exact, which they are not, this would be an underestimate 
of the true figure, since a number of overwintering adults were still being captured un marked 
in June, and possibly also in July, though we have been unable to separate these individuals 
with any certainty from the young. Moreover, in the population at October 1948, there 
were also the survivors of those marked at the trappings previous to this date, which we 
estimated in Table 3 to be around 75 individuals. Such estimates as we are indulging in here 
can only be approximate, but, so far as they go, they suggest that the total Microtus popula- 
tion in October 1948 was of the order of 1000 individuals, or a population density of around 
250 per acre. Comparing the former figure with the estimates of N, in Table 3, it will be seen 
how greatly we would have underestimated the probable Microtus population on the area 
in the autumn of 1948 if we had assumed without any further investigation that the 
sampling of this species was satisfactory from the theoretical point of view. 


B. The Clethrionomys population 


The average range covered in its lifetime by an individual Clethrionomys was somewhat 
greater than it was in Microtus, and the trapping area was less well defined by ecological 
barriers. Nevertheless, most Clethrionomys were retrapped within perhaps 30 yd. of their 
original point of capture, and, as is shown below, the population was not recruited except 
at the times one might have expected it to increase by breeding. 

The general biology of this species appears to be very similar to that of Microtus. Thus, 
from the examination in the laboratory of dead Clethrionomys, trapped at the same time 
on other areas at Lake Vyrnwy, there was no evidence that any breeding was taking place 


during the autumn and ‘winter months; whereas pregnant and lactating females were 
common in the samples obtained during the summer. From October 


| У possibility of error through the loss of these marks 
we have therefore disregarded the September sample for this species. Thus, all those 
animals known to have been first m ‹ Б 


other cases. 


(i) Preliminary analysis 


= 


теа. 
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of this species are not very large, but appear to follow a fairly regular annual course, being 
high in July and October 1948, low in the winter months, and gradually increasing during 
the summer of 1949 to a peak in September and November. In April 1950, the number 
trapped was much the same as in the preceding April, a complete contrast to the results 
for the Microtus population (Table 2). 


Table 8. Distribution of Clethrionomys recaptures according to the interval 
since they were last captured (тл). (88 and 99 combined) 


| Month of capture (2) 


Month when 


last captured (x) је ] 1 | | 
June | July Oct Nov. | Mar. | Apr May | June | July | Sept. | Nov. | Apr 
~i Ж | = | —ÓÓ ------Іі- - 

has; June = | ї# 1 | E = | = = = EX: == — — 
July — - 11 g = = = = = = = E 
Oct. — — — Bi 1 6 2 = | = -- — — 
Nov. сз Mom = — 10 | — 2 — | 1 — = - 
1949: Mar. == — == = | — 15 1 T - - => = 
Арг. са > — == | = — 10! 1 | — = = == 
Мау = => == == — -- — | 5 4 = = e 
June = X кегі == || == || == Ed uos 6 e zd ET 

July SS Set apa tare Жы 10 | - 
Sept. = — — =e) Mere == — — — E 191 2 
Хоу. — — — — | = = = = = -- — 17 
Total marked (sj) — 17° 12 18 | 17 | ша | 215 1 11 10 19: 20 
Otalunmarked (му) | 341 | 26° | 4M 4 | | | 6 7 18: | 361 | 395 8 

~ | ж” айсы -| — |- 
Total catch (0,) за. | 45 | sg | 39 | 25 | 24 | 5 | ла | 39 | ав | 58 | 28 
Otal released (#,) 33 38 49 92 | 95 | 24 20 14 | 28 45 52 28 
| | 


Turning to the estimates of B, given in Table 9, it will beseen that for the period October- 
April, one estimate is negative, one positive, and one not very far from zero. If we consider 
в alive in October 1948, and subject to these 


an imaginary population of 100 individual 
estimated A, = (В+ В) and И values, then, according to the latter, 36-1 out of the original 


100 would be alive in April 1949, and from the successive A, the total population size would 


Table 9. Estimates of population parameters for Clethrionomys from data in Table 8 


A | Б 

Month t | Ne ^ ^ = 

Я ( 0 г = 0-620 == 
948 dum 1 | 517 2.458 0-627 1:831 
July 2" | nus 0:368 0-530 — 0-162 
Nd 3 | 40-8 | 0-973 0-722 0-251 
1949: Mar. | 4 | 39-7 1:000 0:943 0-057 
don 5 | 39-7 0-811 0-639 | 0:172 
May 6 32.2 1:195 0-667 0-528 
T 1 37.3 0-771 0-429 | 0-348 
шн 1 | 29-0 1.643 0-357 | 12986 
iud 9 | 46-0 1-911 0-626 | 1:285 

Nov. 10 | И => " in 
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be 35-8: in other words, on the average the change in numbers is accounted for by the 
operation of the death-rate. (This result is in marked contrast to that for the M icrotus 
population during the same period, in which only 25 out of an original 100 would be alive 
in April, whereas the total population would have increased to 114.) | 

These preliminary results for Clethrionomys ате promising from our present point of view, 
and it is unfortunate that no estimates of the dilution factor could be made for the second 
winter during the period November 1949 to April 1950. We may, however, calculate an 
upper limit to В, for this interval. For this purpose we will define a new quantity, 

U, = Pp! BN,- dy). 
"This parameter may have very little real meaning or importance in the case of the general 
population, since it merely represents the number of new entries ВАМ — а) alive at time t+ 1 
which would have been alive at time t if they had been subject to the estimated survival 
factor Р,. For instance, if the new entries consisted of births during the interval, this clearly 
would be an entirely artificial figure. Mathematically, however, it is the exact equivalent 
of the parameter defined as Z, in the case of the subpopulation of marked animals, and it 
may be estimated by a similar type of equation. Thus we have, as shown in the appendix, 
Ô, = = ШҮРЕГЕЙ J | 
Th 14154 

Now, no estimate can be made of Pry for the last interval in a chain of samples, but we 


can obtain №, and U,_,. Thus, from the data given in Tables 8 and 9, we have for Novem- 
ber 1949, 


Nyp_y—dp_, = 86-0-6= 80-0 and Oy, = 519 «18-35 E. | = 5.6. 
Hence, under the most favourable circumstances, if Py, = 1, we should estimate the dilu- 
tion factor between November 1949 and April 1950 as Ву = 0-07, a figure which is to all 
intents and purposes zero. 

As a check on this apparent absence of dilution during the winter months, we may now 
consider the recaptures of marked animals during October, November, March, April and 
May (adults only), when these are tabulated according to the interval of time since they 
were first marked, Since the number of individuals was in any case small, the data for the 


two sexes were combined, with the results given in Table 10. Calculating the expected 
numbers of marked and unmarked animals as before, we have: 


Month of trapping | Marked | 


Unmarked Total 
1948: Ost, 12 (12-79) 41 (40-21) 53 
мек; | 18 (15-11) 4 ( 6-89) 22 
‘Mas, | 17 (17-53) 8 (7-47) 25 
Apr. 21 (19-50) 3 (4-50) 24 
| y | 15 (18-06) 6 (2-94) 21 
; 


н Owing to the small expectations in the un- 
hi ^ peetations in the 
ich for 3 D.F. is a perfectly reasonable value to obtain. 
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Table 10. Distribution of Clethrionomys recaptures October 1948—May 1949 according 
to the month they were first marked and released. (55 and 99 combined) 


Month of capture 
Month when Total 
first marked | | | | recaptures 
Oct. | Nov. Маг. | Apr. May 
“= р] | и i - 
June-July 12 5 4 6 5 | 35 
Oct. | — 10 13 11 8 | 41 
Nov. -- -- | 1 0 0 1 
Mar. == — | = | 4 1 5 
Apr. | | | 1 1 
No. unmarked 41! | 4 8 3 | 6 
=== р | m ханнын 
Total catch 5 | 2 25 24 21 | 
| | 


Thus, from this preliminary analysis, we may conclude that the sampling of the Clethri- 
onomys population was satisfactory, and that there was no evidence of any pseudo-dilution 
during the non-breeding season as in the case of Microtus. It is perhaps worth adding that 
we also carried out an analysis of these data in which we assumed that no September marks 


had been lost, and reached precisely the same conclusions. 


(8) Data for the marked individuals 

It is useful to have a further check on the applicability of the methods of estimation to 
these particular data by estimating the successive values of Z, from the re-recaptures of the 
marked population. The observed data and resulting estimates are given in Tables 11 and 12, 


s re-recaptures according to the interval since they 


Table 11. Distribution of Clethrionomy: 
ked population only ; 88 and 99 combined) 


were last recaptured (т). (Мат 


нс. 
Month of capture (2) 
1 Month when 
captured (x) July | Oct. | Nov. | Mar Apr. | May | June | July | Sept. | Nov. | Арг. 
— == 
1948: July = ЕК mU = ee сен ee ee 
Oct. - — is V. = E == | 2 = — — 
19. Nov. = = = 11 1 1 = - = = 
49: Mar. — = = = = 9: 1 
Арг. = кер = = кер = 4 3 — -- -- 
Мау -- == TES = 4 — - = 
June = == — — 6 — H 
July | 3 == 
Sept. — — | 7 
Nov. = 
te x | ist | 6 8 6 3 8 
T = 5 4 9 13 
uy E 7 | aM 8 8| 2 1 8 EC OE 
"uM Re Жын ND EE 
| | 
Total eat, | 17 21 М Y т m | 30 
ch (C 17 12 18 | es j P 
otal released [m 17 | 12 | ie | ae | я 1% ý S pug | 7 
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and it will be seen from the latter that between October 1948 and November 1949 a total 
of 151 bank voles were marked for the first time and released, whereas the estimated total 
is 150-8. The individual 2 follow the general trend of the true values very closely, con- 
sidering the small size of the samples on which they are based. In regard to the standard 
errors of these estimates, the same procedure was followed as in the case of the Microtus 
data. The sum of the available yy, is 96, and the sum of the corresponding s, = 67. Hence 
Л- 0:698, and the variances of Z, were then multiplied by the factor 0-302 to give the 


Table 12. Estimates, from the data in Table 11, of the number of Clethrionomys which 
were marked for the first time and released at each trapping (21) 


^ > | True по. Deviation | Standard 

Month ve 2, | released error of 2, 
1948: Oct. 5-00 42-0 | 37 + 5:0 + 13:3 
Nov. 4-00 16-0 | 4 + 12:0 + 61 
1949: Mar. 12-09 | 96 8 + 16 + 42 
Apr 22-33 —11 | 3 — 41 + 42 

May 20-00 2-4 6 = 6:6 | + 24 
June 13-00 | 41 | 7 - 29 + 94 
July 8-00 7:8 17 — 9-7 + 28 
Sept. 6-00 58-3 | 35 +18:3 | + 13:5 
Nov. 557 17-2 | 34 -168 +141 

Total 150-8 | 151 


n of Table 12. Comparing these with the devia- 
, it seems that on the whole the latter are of much 

ampling. The chief exceptions are the estimates 
for November 1948 and J uly 1949, for which the deviations are roughly twice and three 
times the respective standard errors. An unknown margin of error in the latter must, 
however, be allowed for, owin: 


g to the approximate nature of the correct; loyed. 
Overall, the sum of the s Я € correction employ 


MAY m quares of the deviations from the true values is 921, while 

52) 258. no evidence of any marked dis- 
m of estimating an unknown para- 
ble 11, we should not have been led 


her as to the magnitude of this parameter or the changes it 


underwent during the period of Observation. 


(ili) Estimation of tota] numbers 


The results of all the tests which we have b 
ын. мү: ыз cee е been able to apply to these data suggest that the 


number of individuals in the population. у; been described for estimating the total 
' 7 
% 


tables for Microtus. 
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Since we are here dealing with a relatively long chain of samples taken over a period of 
very nearly two years, during which time both the total numbers and the death-rate may 
have been varying quite appreciably, the appropriate methods of estimation are in principle 
similar to those which were briefly discussed in II, §4. Thus, we may take each group of 
6, animals which were caught unmarked and released marked at time x (x = 0,1, 2, 3,...) 
and estimate the number of these surviving at time £ (t = 2+1,2+2,2+58, ...) by one or 
another of the methods given in П, 52. Then, if Gy is the estimated number which were 
surviving at time t, with variance V(G,,), the total number of marked individuals in the 
population at time ¢ is clearly 


1—1 
= бш, 
т-0 
і-і 
апа Vi) = У У (ба), 


since the separate groups of animals are independent. We may then calculate the total 
number of individuals in the population by means of the maximum-likelihood estimate 


Ñ= у, 6-1,2,3,..,Т-1); 
or, preferably perhaps, by the adjusted estimate 
й, = Уа Viet D), 


| a © w gy QV) 
with variance E ИМ) = М ты у | | 


In order to carry out satisfactorily all the various steps in this,computation, however, it 
would be necessary to have a very much larger body of data than in the present example. 
The number of Clethrionomys which were ringed for the first time and released at each 
sampling was small in a number of cases, as may be seen from the unmarked (uj) figures 
given in Table 8. We therefore obtained a pooled estimate of the number of marked animals 
alive in the population in a similar way to that described for the construction of the life- 
Each group of releases was considered separately, and having first of all found the life- 

viduals in the group which were accidentally killed, the re- 


table //, values for any indi : у ; 
recaptures of the remaining individuals were tabulated according to the interval of time 


T 
since they were last recaptured. The values of ~ Ка, В, and XR were then computed from 
z 


these entries, as illustrated above in $ A (iii), by the methods given in IT, $25. For a par- 
tieular month, all the available values of these three quantities in the separate tables were 
then pooled ii well as any l, figures. Then, an estimate of the total number of marked 


animals in the population at time f is given by 
T T , Ја 
J= sns X В+ )/(s X kat) +80, 
2+1 t=2+1 


the variance of this estimate (II, § 26) being 


~ SER,—Srky — 
Vi) = Vi (SER, + 1) (928+ 2) Ы 
a L s 
D 4 тї 


Biometrika 40 
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where in both expressions the symbol S is used to indicate the pooled values. The estimates 
of Хала V(X) then follow from the equations given above. The following results were 
obtained: 


Month SUK, SR, | SER, | Sl pits 8, с, Nis 

1948: July 9-17 14 12 3 20-9 + 1-5 17 43 511+ 6-9 
Oct. 16-67 12 27 0 19-0 + 1-6 12 53 78-9 + 12-9 
Nov. 30-36 18 60 1 36-0 + 2-6 18 23 43-64 41 

1949: Mar. 24-51 17 44 1 31-04 2-3 17 25 44-84 49 
Apr. 19:10 | 20 34 1 35-8 + 3-0 21 24 40-74 29 
May 1400 | 13 21 2 211€ 1-6 15 21 29-04 3:3 
June 7:60 7 17 2 16-7 + 2-1 7 14 31:34 6:2 
July 8-00 11 8 0 11:0+0 11 29 27:5+ 36 
Sept. 4:00 10 6 0 140 + 1-8 10 46 59-8 + 12:0 
Nov. 7:00 18 10 1 25:8 + 2-6 19 58 76-1 + 11-4 

| 


One minor point is to be noted in regard to the estimates of y. At the trapping in July 
1949, we have yj, = 


11-0 + 0; in other words, it seems that the entire marked population at risk 
was captured. It is always possible, of course, that in the case of a small population and 
a high degree of sampling effort, this may happen, but if so we must have Ё, = C, when we 
use the maximum-likelihood estimate of N, and in using the adjusted estimate we will have 
a value of М, < C, as will be seen in the last column of the table, 

The standard errors of 7 and N, given in the table are adjusted to allow for the proportion 
of the population which on the average was being sampled. Thus, by summing the ў, and 
s, columns in the table, we have f, = 147/231 = 0-636, and 1 —f, = 0-364, this factor being 
then used to correct the variances. (It is interesting to compare the value of this factor with 
that used to correct the variances of the estimates of Z, given earlier in this section. There, 
1-Л = 0-302, this figure being based on the estimated y, and observed в, іп Tables 11 and 
12. Those yy, and s, are not, however, the same as the ones in the present table, since they 


represented the individuals in the subpopulation of marked animals Which were marked 
at least twice.) It will be seen from the estimates N, of total numbers that the trappable 
Clethrionomys population on the area 


was not very la; 
more than 100 individuals of both s 


exes at the ре, 
figures we might s 


and hence the death-rate, 


where w, is the true interval i у 
We also have en the particular sam 


A= Mual(M,—d), 
ring the interval by 
р; = log, Alw. 


plings to which P, refers. 
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This rate of increase p, between ¢ and #- 4, which we assume to remain constant over the 
interval ¢ to +, is composed of a positive quantity Ду, representing the rate of dilution, 
and д; the death-rate, so that //, д, = р. Having calculated p, and ду, we thus can obtain 
Ё, very simply by subtraction. The sampling errors of these estimates, however, are more 
difficult and we have been unable to solve the problems involved. The difficulty arises over 
the question as to the degree of correlation which may exist between the successive estimates 
of ү, and N, which are obtained from the data treated in this way. Any such correlation 
which exists is likely to be negative in sign, and, if we were to neglect this and merely write 
down the expressions for the variances of p, and А, according to large-scale sample theory, 
we run the risk of underestimating the true sampling variances of the resulting Й, and д. 
Leaving aside this question, however, the following values of p, В, and дү, expressed per unit 
of 28 days, were obtained for the Clethrionomys population: 


Dilution-rate Death-rate Rate of increase 
(0) (ш) (Pi) 

1948: June- — 0:305 —. 
Toly 0:475 0:283 0-192 
Oct.— — 0-067 0-294 — 0-361 
Nov.- 0-069 0-062 0-007 

949: Mar.- — 0-009 0-071 — 0-080 

= E 0:271 0-609 — 0-338 
May- 0-559 0-446 0-113 
June- 0-681 0-798 —0-117 
July- 1:005 0-462 0-543 
Sept.-Nov. 0-320 0-229 0-091 

| | 


The first point of interest to be noted is that the three dilution-rates between October and 
April are each essentially zero, а result which might have been expected from the pre- 
liminary analysis, but whichneverthelessserves asa useful check on our original approximate 
estimates of В, in Table 9, since we are now utilizing much more of the information contained 
in the data. After April the dilution-rate gradually increases during the breeding season 
of 1949, The death-rate in June and July appears to differ quite markedly in the two years, 
being about twice as great in 1949 as in 1948. It is interesting to find that the death-rate 
falls to a relatively low level during the non-breeding season between November 1948 and 
April 1949, in a similar way to that of Microtus. It then increases markedly after April, 
but the dilution-rates are at that time sufficient to keep the population relatively stationary 
in numbers until July. There is then a sharp rise in the rate of increase between July and | 
September, which is presumably due to an influx of the young born during the latter part 
of the breeding season, which have grown up to become part of the trappable population. 
The broad pattern of these results thus confirms our previous knowledge of the biology of. 
this species, and in addition we have obtained a numerical measure of the changes which 
Occurred in the various population parameters during the period of observation. 


4. DISCUSSION OF RESULTS 


In the first two papers of the present series we were concerned with the theory of estimating 
Population parameters from а series of samples, granted certain premises. We could also 


Show that under conditions when these assumptions were approximately true, as in the 
II-2 


% 


ef 
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case of a population of counters under our strict control, valid inferences could be drawn 
regarding the number of individuals and the death-rate in the population. In the present 
paper the chief emphasis has been upon testing the truth of the assumptions as they apply 
in nature. Unless such investigations can be undertaken, the parameters of natural popula- 
tions cannot be estimated with any confidence from data obtained by means of the capture- 
recapture method. 
_ The present field data were collected in a manner which avoided as far as possible the 
grosser errors, such as non-random marking and resampling, or interference with natural 
processes. No trapping technique, however, can sample a population in a manner that is 
independent of the behaviour of the individuals, and biased sampling may be unavoidable 
if there is no homogeneity in the response to the traps. Because this difficulty was en- 
countered, no parameters could be estimated for the Microtus population as a whole. How- 
ever, from several lines of statistical evidence it appeared that once the animals had been 
trapped they behaved thereafter with sufficient uniformity for valid inferences to be drawn 
about the marked population. Thus, although the theoretical methods of estimation 
developed in the previous papers cannot be applied in their entirety to the Microtus popula- 
tion, some important features in the latter may still be described. There is also much addi- 
tional information which we hope to use in a later paper. By contrast, the results for the 
Clethrionomys population provide no evidence by which to reject the — assumptions 
of the marking-recapture method of analysis. Accordingly, the death-rates, dilution-rates 


and total numbers may be given with some confidence. Even here, however, one reservation 
must be made. | 


Let us first of all review the evidence on which the estimates of the total Microtus popula- 


tion were rejected and those of Clethriono 
mys were accepted. isolate 
nature of its habitat that the Microtus pted. We knew from the isola 


if, in such a са; 14 obser- 
Сини Я se, the field о 
ght * der ы 8 more elaborate experiment? 
^ "А 5, 76 possible to enumerate the entire 
Жы қ 
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population of some species at certain times, so that a few of the estimated values could be 
compared with those observed. Perhaps a more practicable suggestion is to try sampling 
by methods which depend upon other properties than those which determine the com- 
position of the catch by the methods normally employed. These alternative methods might 
be too laborious for regular use and would then be used solely as a check. On the other hand, 
if they proved that the old methods were biased there would be no option but to make а - 
complete change in the field techniques. ЕР" 
Finally, it should be emphasized that the results of this analysis do not imply a universal 


` law for each of the two species. The data for some other Clethrionomys population might 


not prove во satisfactory as those presented here, and for the present, at any rate, it will be 
necessary to consider each particular case in detail before deciding that these methods of 
estimation are applicable. 


5. SUMMARY 


1. Two populations of small rodents, Microtus agrestis and Clethrionomys glareolus, 
living on the same area, were sampled by means of a live-trapping technique over a period of 
very nearly two years. The resulting data are analysed here in some detail in order to see 
whether any valid estimates of population parameters can be made by the theoretical 
methods described in two earlier papers. These methods are based on the assumptions that 
the sampling of the population is entirely at random and that all classes of marked and 
unmarked animals are caught with equal facility. The main purpose of the present paper 
is therefore to see whether or not these assumptions were true in the field. 

2. The sampling of the Microtus population as a whole proved unsatisfactory, since it 
appeared that during the winter months marked and unmarked animals had not the same 
chance of being trapped, members of the latter class evidently being less willing to enter the 
traps. Since this difference in behaviour was observed during the non-breeding season, 
when no new members should have been entering the population at risk of capture, we 
could not exclude the possibility that it was also occurring at other seasons of the year. In 
consequence, it was not possible to estimate the total numbers of Microtus on the area by 
methods which are based on an assumed random sampling of all classes in the population. 

3. Once trapped, however, the individuals in the Microtus population appeared to behave 
satisfactorily from the point of view of capture-recapture theory. It was therefore possible 
to use the data for these marked animals for the estimation of the death-rate, and for cal- 
culating the further expectation of life of animals marked for the first time and released in — 
different months. Such figures, however, which can be estimated by the methods illustrated 


in the text, can only refer, strictly speaking, to that part of the total population which had 


been marked. У 
4. The Clethrionomys population, on the other hand, behaved differently. There was no 


evidence of any differential sampling of the various classes of marked and unmarked | 
animals, as in the Microtus population. Consequently, we can apply to these data, with 
Some degree of confidence, the appropriate methods of estimating the various population 


Parameters. 


The expenses of this — № partly paid for by a grant from the Agricultural 


Research Council. 4 % 
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APPENDIX 


The method of obtaining approximate estimates of N, and P, from a long chain of samples grouped by 

Method B, which was described in II, § 56, was modified in the following way. Instead of using only the 

last two entries, m, ә, and 7, ,,, in each column of the original table, we construct à new table by 
1—2 

retaining each value of m, ,, and forming the sums У та, which we define as n, (t = 2,3,4,..., Т). 
т-0 

Including the unmarked animals, we have therefore at each sampling subsequent to # = 1 the three 

classes мү, т, and y 4, (n; 4-71, 4, = S 8 Ьа, = C,). Since у, is the total number of marked individuals 

in the population as a whole at time #, 


Vo = 0, } (1) 
Vea = PE y) 
where, allowing for the d, accidental losses or removals, у, = u,— d, = R,— з. Then we have the following 


expected number of individuals in the population as a whole, together with the observed numbers in 
the three classes, at the time the sample was taken at t+ 1: 


Na = Vea Ua 

Via — PB, Tta 
PB, Пал 
Ма Сан 


It is convenient to introduce a new parameter W, defined by 
Na =Р И, : 
and from (1) to rewrite this table of expected and observed numbers in the form 


РУ, (+ у] 


Up 
Рив.) Та 
PR, Tua 
P, W, Cu 
from which we may write down the log likelihood equation, after cancelling P, 
L = илов М, — (у y)]4- пал log (ү, — 8) — Се log W,. (2) 


i 1 B E " ^ ^ 
Hera ee г ini np eR mi estimates W, and Ws together with their variances and 
Ma aUbi DArsmelers iaa Since, however, we are concerned here not so much with yf, and Wy 28 
icy 0 e emnes i = € to them, it is simpler to proceed in another fashion. It will be 
PNE LO eod cos de. P of freedom for estimating these two parameters and that the maximum- 
аи Бае Vern ТЕ ependent. Under these conditions we can equate expected and observed 
у, ). Thus, xample, we have the pair of equations 


Me = Cu) 80) We 
Dh 44 = О, БАТ, 


in the present e 


from which Î, = ый 
y, = Mette - 
t Mitri +8, (t= 1,2,3,...,Т-1) m" 
and 
AR | У, = БОт a. 
ast equation may ђе rewritten with the help of (3), usi 
; using 
~~. А 8, = Њу, апа Tea My = 353 
orm = i 
у, = Gc y) €, a (£2 0,1,2 
ерни, + 1,2, „Т—1). (4) 
t= ај y) 4-0,1 
and since N, = Р, "1,2, „Т—2) ы 
t = Ре У, 1, we have, from (4) and (5) | 
А А | 
№, = ҮС, (t= 1,2,8,...‚т_ 1) 


үз 


m~ 
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The variances and covariances of these estimates, appropriate to large-scale sample theory, can be 
obtained by means of the type of formula given by Fisher (1950, $ 55). Thus, if Фф, and фу are two functions 
of the observations a; of which the expectations are ть, with Ха; = Em; = n, the total size of the sample, 


then (Ф) = Ry (20), (7) 


and a similar equation for Г(ф»), while the covariance is 


ЕТІСКЕ! 


In applying these formulae to the present type of problem, there are, however, two points to be borne in 
mind. In the first place we may have a parameter which is a function of the observations from more 
than one sample, in which case we must also sum over the samples. And, secondly, it is necessary to 
distinguish between observations which enter into the expectations as given values, and the actual 
observations themselves. Thus, in equation (3) it is true that В, and s, are observed values at time t, 
but they enter into this expression as part of the expectations at £-- 1, which are conditional on the 
results of previous sampling. Here we regard yy, as a function of the observations at t+ 1, and since С, 
does not occur explicitly, there is no term in (л) corresponding to n(29;/0n)* in (7). Thus from (7) we 
have, in terms of observed values, 


Vin) = 


2 2 2 
таВ тый Rinse 


(туға) (Matz)? (тугал) 


Formally, we now should substitute expected for observed values, and write 


Wilt y) (У: — 80 
Vi) = AA E ад 


а result which may be verified by the more lengthy process of forming the maximum -likelihood equations 
from (2) for у, and Wa and then inverting the corresponding peices под In actual practica, 
of course, we would calculate the numerical value of V(y/) by inserting the estimated W, and y, in this 
expression, and since W, is a parameter which was introduced merely for convenience (it is not necessary 
here to calculate its actual value), it is more practical to write 


Та 
7t.) = (у 2 Ulo 
Vin) = Gt м) EG 

Viv) Уа) 
(уд Vita 


б,— Уф 


There is one further parameter we wish to determine which arises in the analysis of the subpopulation 
of marked animals, namely, 2, the number of new additions to this population which were made at 
time 2, From the theoretical point of view we therefore consider a population consisting of a variable 
number М, of individuals at the time the sample of О, is withdrawn, and we make the rule that any 
h new members shall only take place at the actual time a sample was 
returned. Thus, if an unknown number of 2; new members enter at time t, when the В, survivors out of 
the original sample of О, are returned, the total number of individuals in the population at this moment 
is N, +Z,—d,. Then, since by definition no further new members enter during the interval 2 to t+ 1, the 
total number of individuals when the sample of О, is taken at ¢+ 1 is 


, 


In a similar way we have Ү(Р,) = ғ| 


dilution of this population wit 


Мы = РАМ,+®%—4). 


Now clearly (М,+ Z,—d) is the same parameter as we have defined earlier as W,, and hence from (4) 
2. t 
and (6) 


t 


" ^ 
G, Ни бы Ha, (8) 
t 


8а 


ж Сл — па С == = 
or alternatively Z= D аи 1] (=, и. 


8+ 
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The latter form is more convenient in practice if we merely require the estimates for a set of observed 
frequencies, but if in addition we also require the variances of these estimates, then the former is to be 
preferred. For, if we write p, = з С» а: = 1—p, we have, proceeding as before, 


А А 
2 1 РА? 2 2 
vÊ) € (==) | [(® та) Tua +в |} + (5) а. 
Ред 8+1 Pe Meter Tu 8 
All these results can be verified by considering the expected numbers in the population and the observed 
numbers in the samples taken at £ and #+ 1. Thus we have 


t 4-1 
| N,— y, щ P[W,— (+00) ““. 
й—РВ, т О) Tuy 
PaRa Tu PR, Tu taa 
N, C, PW, Char 


from which we can easily write down the maximum-likelihood equations for the simultaneous estimation 
of the four parameters which can be determined from these samples, namely, P, ,, у, №, and И’, and 
obtain the corresponding 4 x 4 information matrix. By inverting the latter, the variances and со- 
variances are obtained, and finally Z, апа V(Z,) from 


= W,— а, 
VW) = V(W,) + VS) – 2 соу (ТУ, Ñ). 


This procedure is, however, very much more lengthy and tedious than the one adopted here. 
There remains the question whether this maximum-likelihood estimate of Z, is biased. It will be noted 


from (8) that 2,18 essentially the difference between two estimates, each of which is of a type known to be 
positively biased (II, $ 1). If we could assume that the absolute amount of bias in each case was of much 


the same order, then we might regard Z, are being approximately unbiased. But it is doubtful as to 
how far this assumption would be true in general, and it is more likely that when 2,> 0 the bias of the 
positive term would tend to be greater than that of the ne, г 


h ative. Und i Z might 
have some degree óf positive bias, and in ; Aah peeing ON, ake! 


Ee place of (8) we might therefore prefer to use the adjusted 
= (ety) (Ca + 1) YO+ D d, 
S541 8+1 | 


* H ^ . 

Е the comparison of Z, with the known true values of this parameter played such an important part 

| sar зыш in ешш, it was therefore of interest to see whether the use of this adjusted 
а ауе made any difference to our conclusions. i T i 

obtained in the case of the data for the marked Mirror oe bie mis 


tus population: 
fx T 
True values 2, 7, A(Z,) A(Z) 
t 
Бері, z 48-2 44-8 — 16-8 | — 20.2 
Ері, 51-4 46-5 +10-4 
er. 45 38-9 37-5 - 61 ме 
= С 66-8 65-3 - 92 244 
i [n n 107-3 +468 144 
May 9-6 22.3 — 26-4 23-7 
oe 48 35-5 35-9 —19.5 E 
Sept. 65 24 67-4 +155 1184 
Nov. 108 78-8 78-4 13 tia 
88-9 81-3 т oa а 
= — 26-7 
Total 613 609-4 586-0 i 
Lj 
қ 1 


A 


т», 


ao owe 
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It will be seen that, with one exception, Z«2, and that we do not obtain quite such a close corre- 
spondence between the estimated and known total number of Microtus released over the whole period. 
Taking the individual Z, however, it seems that on the average they are, if anything, in slightly better 
agreement with the true values than the Z,. Thus, the sum of the squares of the deviations ZA*(Z;) = 4196, 
whereas хАЧ2) = 4351. These differences аге, however, slight. We concluded, therefore, that if the 
maximum-likelihood estimate 2, was biased, the degree of this bias was probably small compared with 
the sampling errors involved, at any rate in this series of data. 
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ON THE UTILIZATION OF MARKED SPECIMENS IN ESTIMATING 
POPULATIONS OF FLYING INSECTS 


By C. C. CRAIG 
University of Michigan 


1. INTRODUCTION 


Professor William Hovanitz called my attention to the following problem: An observer 
catches butterflies, marks them, and immediately releases them. It is assumed that g 
butterfly, no matter how many times it has been caught before, has the same susceptibility 
to capture as any other butterfly in the population which is supposed stable while the 
captures are being made. Records are kept of f,, the frequency of cases in which the same 
butterfly is caught z times, x = 1, 2, ..., until a total of s captures of r different butterflies 
have been made. (Xf, =r; Ха), = в.) The number, fy, of butterflies which escape is not 
observed; the problem is to estimate from the values of f, the total population n of butterflies 
on the area assumed well defined. 

The estimation of biological populations by means of capture-rec 


means а new problem, though papers dealing with it from the m 
point of view are largely quite recent. 


apture data is by kar 
athematical-statistica 


» twice, ete., with 
this point at the end of the paper. 
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2. THE APPLICATION OF A TRUNCATED POISSON DISTRIBUTION 


If we suppose that the total period during which captures are made is composed of a large 
number of short intervals sufficient for the capture of a butterfly and that each butterfly is 
equally subject to being netted in each such interval, it appears that one may consider that 


Еу) = ne (w= 0,1,2,...), 


If so, the likelihood of the observed sample is 
ет" Asn! 


(11)5 (27s... (в) fV fe! ee’ (1) 


L= 


In which я and A are parameters. 
Before proceeding to maximum-likelihood estimates, we may note that two moment 


estimates are immediately available. 


Method 1. We have E(r) = п(1—е7%), 
(8) = na. 
Equating the observed 7 and s to their expected values, we estimate A from 
r l—-e^ 
DM. (2) 


Using this value of A, we estimate % from 
й = s[A. (3) 
In Practice we can estimate n directly by eliminating A between (2) and (3), getting 
logz —log(n—7) = s/n, 
Which 15 readily solved for the integer % which most nearly satisfies it by the use of a good 


table of natural logarithms. : | 
In correspondence with Prof. Hovanitz, Prof. Sewall Wright gave the above estimate 


and also suggested the use of the method of maximum likelihood. 
Method 2. Tt is still simpler to use the first and second power sums 
(==) and = Dt Bn 
For а Poisson distribution, E(s[n) = Е(вујт) — LE (sh), 
or [E(s)P. = пјЕ (sz) — Ё(в)]. 
Thus using s and s; for their expected values we have the estimate, 
ñ = 5°](85— 8). 
Though this is obviously subject to greater sampling error than the first estimate and the 
Others to follow, it serves as a quickly obtained rough value which also can be used as a first 


Tial value in solving the other estimation equations. | 
Method 3. Let us consider the truncated Poisson distribution for which we have observed 


"quencies as a complete distribution function, i.e. we write 


e^ ЛЕ 
EM MSN = 1,2,...). 
= тела h ) 
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Now B £) = E@) = E cs (4) 


_ |і 
and equating this to s/r, we estimate A from 


r[s = (1 —e-*)[A, 
just as in method 1. 
But now the likelihood of the sample is 


Asr! 
m (e 1)" 005 (207... A fal? 
and log L = slog A— log (e^ — 1) + terms free of A. 
Then dloggL s T 


@ x gm 


is equation (4) again. Thus in this case the estimates obtained by the moments principle 
and by maximum likelihood agree. We next estimate n from the relation 

E(r) = n(1— 67), 
asin method 1. That is, methods 1 and 3 are mechanically the same, but under the argument 
for method 3 the estimate for A is that given by maximum likelihood. 


Method 4. We return to the likelihood function (1) in which both n and A are regarded as 
parameters. Now 


log = – пл + slog А 4-log [n] + terms free of A and n 


dlog L $  OlogL т= 1 

and =—#+- — = 

a tb. р 
Equating these to zero and eliminating A, the estimation equation for n is 


g #23 


п xceon—k 
Now the only solutions of this equation of interest are those for which n>r. It is not 
difficult to show that such solutions exist if 


s[r<l+ht...41/r. 


But for any fixed population, "< т, while s, the total number of butterflies caught, may be 
increased without limit (ignoring the wear and tear on the butterflies). Thus for increasing 
sample size the likelihood equation fails to have a meaningful solution, and the usual 
theorems on the asymptotic behaviour of likelihood estimates are not available. Of course 


Ea а a т has n for its stochastic limit. For all the sixteen field experiments for which I have 
Е "a pes s mr dn exist and can be found using tables of sums of reciprocals for 28 
= У using the estimate given by method 2. For larger n’s a good approximation to 
~ 1/(n.— k) is given by 


Re logn—log(n—r--1)-M1J(n — 41) 1/n], 
and the estimation relation on neglecting terms of O(n —74- 1)? is 
logn—log (в + + ит 
As shown in the table at the 
other methods. 


)+ 1/n] = s/n. 
agree quite well with those given by the 


, 


end, these estimates 


| 
| 
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3. THE APPLICATION OF STEVENS'S DISTRIBUTION FUNCTION FOR GROUPS 


А second mathematical model may be arrived at by deriving an expression for the expected 
number of times a butterfly will be caught if a total of s captures is made, if it is assumed 
that each butterfly, no matter how many times it has been netted before, is equally liable 
to capture and by considering after each capture the probability that the next will be of 
à butterfly that has been caught before. But this was seen to be equivalent to a problem 
Solved by W. L. Stevens (1937) as follows: Let each of a set of objects on being drawn be 
equally likely to be assigned to any one of nclasses. After s objects have been drawn and 
80 assigned to classes at random, what is the probability that exactly r classes will be 
Occupied? Stevens's solution (which I also obtained before consulting his paper) is 
mn? NOS по 5 


= — 05 (5) 


Ша == a 
TS ns r! p 


in which 0? is a Stirling's number of the second kind (see Jordan, 1947, pp. 171-2). (Actually 
Stevens’s problem was not original with him either; see Jordan, 1947, pp. 177-9.) 

With this distribution law » is the only parameter to be estimated. Again this can be 
found from a sample with a given r and s either by moments or by maximum likelihood. 

Method 5. Stevens found that 

E(r) = n1 — (1- ут), 
ànd thus from a single sample we have the estimation relation 
1—r[n = (1- ут). 
This can be solved by trial and error by means of a good table of logarithms fairly con- 
Veniently by rewriting it 
log (n—7) = slog (n — 1) — (s — 1)logn. (6) 
Method 6. The likelihood of the sample is given by (5). We have 
log L = logu, = log п) — 8100 т + terms free of n, 


бї 1 
dlogL _ X T ea (7) 


аа dn 15т-Б n 


This equated to zero gives the same estimation relation as in method 4 though arrived at 
from a different model. That this is to be expected can be verified by observing that if in (1) 
Ais replaced by s/n, then to maximize (1) is to maximize n/n’. | 

Могеоуег, it is readily seen that numerically methods 5 and 6 are almost equivalent. In 
Practice we take in method 6 the value of n for which the right member of (7) is most 
Nearly zero, Suppose that instead we seek the largest value of n for which 


(n—1)9 т” 
(а-1 “т 


T 1} 
But this inequality reduces to I-2s (а = 3 " 


However, it is obvious that in this case, too, there are the same difficulties in regard to the 


атре sample behaviour of this estimator as in method 4. ‚ 


^ 
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Let us now derive the large sample variances of the other estimates which are three in 
number, since methods 1 and 3 are equivalent. к 

For method 1 we proceed as follows: With А replaced by its estimate s/n, the estimation 
relation is r= (1-6-9), 
дп ôr — e^ bs 


and from this n па ехо део 


(8) 
Of course ©, if Ӯ is the estimate, will be proportional to n, and we will find Tin which is 
given to O(1/n) by finding the expected value of the square of the right member of (8). We 
have then to O(1/n) 

е—^(1—е—^) — де“ A eà Де“ РА 1 


оў = === 
Qe n(l—e-^—AÀe- е^ —l—2A)' 


(9) 
For method 2 proceeding in the same way, starting with n = 8%/(8, — s), we have 


dn _ 2(s,— s) sôs — s*(8s, — дв) | 


n n(Sa— 8)? 


Taking the expected value of the square of the right member and the values of moments 


about zero of a variable obeying the Poisson distribution law, we get the simple result to 
O(1/n) in this case, 2 


Thm = ә (10) 


As is to be expected, this variance is larger than for method 1. 
For method 5 the result is a bit more cumbersome. We first get 


0n —— (n—1)ér 


nm r(s+n—1)—ns’ 
in which on the right r satisfies the relation 


SEE 
и 


апа it is easy to find Efr(r—1)] = a [n5 2(n— 1 (n— 25] 
PS Я 


1\s 8 
= n(1-2) +ntn— (1-2) ~ne(1— а) 
N, n) ` 
This 


ü 
a 5, ты. беш be so small for even moderately large n’s even though the last two terms 
5 appears necessary to calculate it just as it stands. We have to order O(1/ ”) 


Next we must find 02, We have 


Using these 


— (-1)6 
Tin паат) (11) 


s ш d ч Р га ces 
with a pair of numerical exam lne für estimates of т and their varia? 
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Example 1 


AT _ 
0 
1 66 8-79, 5=78 
2 3 
602r 


By method 1 (or 3) on solving logn —log (п — 69) = 72/n we get n = 840. Our estimate 
of A is then 72/840, from which 


2 1 — ОТБ 
dic 7 лей 9 
and бат = 0:561. 
722 
By method 2, й= 75 = 96% 
235 [9т\ї 
Can = (ЕУ = (=) ‚ро: 


By method 4, on solving 


cd. A 7% 
logn — Јов (n ву (ate E 


We get n = 828. By method 5 (or 6) we also get n = 828. For method 5 we have 
бат = 0'55. 
All of these values of the standard deviation of the proportional error of the estimate of 


% are high. This illustrates the fact that for an n of this magnitude a catch of only about 
% of the population was not an adequate sample for the estimation of the total. 


Example 2. Let us see how matters turn out in another example in which the estimated 
Population is about the same but the total catch is some six times greater. 


Data Estimates and standard errors 

| f Method й | бат 

me 1 and 3 856 | 0.0871 

УБ 2 901 0-0976 

EE : р 

3 3 5 858 | 0.0867 
— 6 853 | — 
34127 


8- 435, s= 901 


Тһе following table is a summary of the results for sixteen experimental catches. 
Е inally, Ireturn to the question mentioned earlier, of testing the suitability of the models 
Used to the experimental conditions. For the first model using the estimates of n and 
obtained, it is easy to calculate the predicted frequencies for 2 = 0,1,2,.... For the two 
&Xamples given in detail using the estimates found by methods 1 or 3, the frequencies 
“aleulated for 2 = 0,1, 2, ... are: Example 1: 771-0, 66-1, 2-8, 0-1; and Example 2: 515-1, 
61.7, 66-5, 11-2 jud. 01. АХ? test is superfluous for the first; for the second the significance 
level is above 0-5. I hue similarly tested the remaining fourteen examples for which I have 
ata, The significance level for x? was above 0-5 in every case but one, in which it was 
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approximately 0-3. The second model does not lend itself to a similar check. For that I = 
have to offer the fact that the estimates obtained by its use agree quite well with those foun 
from the first model. 


| 
Observed values | Estimates of n by method | Values of Fijn for method 
T 8 5 | land3 2 4,50r6 | land3 2 5 
69 72 78 840 864 828 0:56 0-58 0-55 
93 108 140 352 364 348 93 -25 р 
159 187 249 560 564 557 17 18 d 
144 161 197 708 720 703 -22 -24 E 
63 76 108 196 180 193 0-94 0:25 0:24 
56 66 92 195 168 192 28 28 28 
48 74 188 79 86 78 14 18 e 
341 435 645 856 901 853 087 098 08 
276 330 450 895 908 892 0-11 0-13 0:12 
222 249 303 1063 1148 1059 18 19 Um 
154 225 385 277 316 275 090 112 09 
ы 148 180 256 444 426 442 15 16 15 
63 91 155 116 129 114 0-15 0-18 014 
60 98 216 91 81 90 11 13 A 
71 118 258 105 99 104 10 12 a 
46 89 211 59 65 58 092 13 08 
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THE TOTAL SIZE OF A GENERAL STOCHASTIC EPIDEMIC 


By NORMAN T. J. BAILEY 
Nuffield Lodge, Regent's Park, London 


1. INTRODUCTION 


The early work on the mathematical theory of epidemics (e.g. Ross,* 1916 and later; 
Brownlee, 1918; Kermack & McKendrick, 1927 and later; Soper, 1929) was invariably of 
n ‘deterministic’ nature, and assumed that for given numbers of susceptible and infectious 
Individuals, and given attack and removal rates, a certain definite number of fresh cases 
Would occur in any specified time. However, it is widely realized that an appreciable 
element of chance enters into the conditions under which new infections or removals take 
Place. The probability approach was fundamental to Greenwood’s (1931, 1946) use of chain 
binomials in discussing the distribution of multiple cases of disease in households. Recent 
discussions of these problems (e.g. Bartlett, 1946, 1949; Bailey, 1950) have therefore turned 
to ‘stochastic’? models. In these we have, for any given instant of time, probability dis- 
tributions for the total numbers of susceptible and infected individuals replacing the single 
point-values of the deterministic treatments. Stochastic models have a special importance 
in this context due to the fact that for epidemic processes stochastic means are not the same 
as the corresponding deterministic values. Although for large homogeneously mixing groups 
deterministic methods might be fairly adequate, it seems likely that in practice epidemics 
Actually occur in several relatively small groups of friends and acquaintances, the epidemio- 
logical returns for an administrative unit being compounded of many such comparatively 
distinct processes. Moreover, when we are considering the distribution of cases of a disease 
in a household the size of the group is always so small as to demand a stochastic treatment. 
| In my 1950 paper I discussed a simple stochastic epidemic where none of the infected 
individuals was removed from circulation by death, recovery or isolation. This might well 
apply to some of the milder infections of the upper respiratory tract, and can also be used 
Approximately to represent epidemics for which the time taken for removal from circulation 
18 long compared with the time usually required for the epidemic to be completed. The 
Present paper considers the more general problem of allowing for both infection and 
Temoval. The analytical difficulties present in the treatment of the simple epidemic appear 
here in a more acute form, though it has proved possible to compute the frequency dis- 
tribution of the total size of the epidemic for moderate group size given the ratio of removal 
O infection rate. The results obtained may be compared with those described by Kermack 
MeKendrick (1927) in the deterministie case. No obvious analogue to the threshold 
богет can be discerned in the stochastic models. An important application of these 
Tesults is to the problem of the distribution of multiple cases of disease in a household, and 
à method is given for obtaining maximum-likelihood estimates of the ratio of the removal 


to Infection rate. 


ж Although Ross started with the idea of probability his mathematical theory is essentially 


deterministic. 


ї2 
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2. DETERMINISTIC TREATMENT 


We must first glance briefly at the results obtained in the deterministic case. The following 
treatment, with constant infection and removal rates, is substantially that given by Kermack 
& McKendrick (1927), though with some slight alterations to their notation. 

Consider a homogeneously mixing community of n individuals, of whom at time ¢ there 


are v susceptibles, у infectious cases in circulation and z individuals who are isolated, dead, 
or recovered and immune. Thus we have 


уча = п. 


Now suppose that there is a constant infection rate В and a constant removal rate у, so that 
the number of new infections in time dt is fy dt and the number of removals from circulation 
is yydt. We can choose our time scale so that t is replaced by ft. Then it is easy to see that 
the course of the epidemic is represented by the differential equations 


ат 

di^ v 

тр = PY (1) 
dz 

di^ PY 


where p = y/f, the ratio of the removal to infection r 
assume that x is approximatel 
epidemic can start to build u 


l density of susceptibles is n = p 4-y then the 
ve rise to an epidemic, after which the density 
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by the usual methods that the whole process can be characterized by the partial differential 
equation for the probability generating function П: 


әп * el oll 
ape Ww) aay PO) ==, (2) 
where П = 59955 (3) 
7,8 
with limits O<r+ts<nt+a, O<r<n, 0<5<т+а. (4) 


Equation (2) is substantially that given by Bartlett (1949, equation (49)), putting his 
Immigration rate equal to zero. 
Let us now use the Laplace transform and its inverse with respect to time given by 


4*0) = | отежа, RO)» ој 
0 


| (5) 
= ИСТУ 
$(0 = тё) ә à 
cic cetiw 3 ; 
Where | = lim | , and с is positive and greater than the abscissae of all the 
с 


—ico w>oJ сЁ 


residues, Taking transforms of (2) and (3), and using the boundary condition 

Рла(0) = 1, (6) 
We obtain (02 wo) IE pao) 07 — ATP + wre = 0, (7) 
and i П* = У ир, = ин, (8) 
where tn = = [ерада (9) 


Substituting (8) in (7), and equating coefficients of ите, yields the recurrence relations 
(r+ 1) (s— 1) дуда, 5-1 — (sr +p) F Aj dis + P(S 2A 1) Фу, в+1 =0 

and — (a(n- p) + A) ла + 1 =0, (10) 

With O<r+ts<nta, O<r<n, 0<8<т-а. 


Any drs Whose suffix falls outside the prescribed ranges is taken to be identically zero. It is 
evident from the form of the equations that, starting with q,,,, all the quantities q,, could 
be Calculated successively. Using the inverse of the Laplace transformation, we could then 
arrive at the re quired Ж exhibiting them as sums of exponential terms like e~*+)!, There 
Seems to be considerable difficulty in handling such expressions in a compact and convenient 
Way to give, for example, epidemic completion times or the stochastic epidemic curve 
Showing the rate of change with respect to time of the average total number of removals at 
any instant. However, some Progress is possible if we concentrate attention on the total 
Size of the epidemig 1o. "Ihi value ofn+a—% for + = 00. Аз i— o0 all terms in the expansion 
of 8,48) involving negative expon entials like c-/0*?' vanish unless i = 0. The non-vanishing 

erm is the coefficient of A7! in the partial fraction axpansinnef qu in termacf QU: pg] АР, 


12-2 
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i А 'obabilit: 
Now the epidemic ceases to spread to fresh susceptibles as soon ass = 0. Thus the probability 
of an epidemic of total size w (not counting the initial a infectious persons) is 


В, = lim p, qt) (0<w<n), 
t>o 


= lim Niana 
А>0 


І 


lim рд, „а putting r = n—w and s = 0 in (10), 
2-0 


(11) 
= Pew 

where fa = lim frs» | (12) 
for I<r+s<nt+a, 0<r<n, lescntal 


Р ; у b 
The quantities f. evidently satisfy the following recurrence relations obtained from (10) by 
writing f, for q, and putting A — 0, 


and 
with same limits as in (12). 
Some further simplification results from writing 


(+1) 6-05 ар) pis -1)f, guy = | (13) 
ап +р) ја "1 = 0, 


n! (ғ +p— 1)! puta—r—s 
р. = Ерни 


(14) 
7 Urs- 

87! (n+p)! 

Substituting in (13) gives 


rci s-1— rs + (r ЧР) а ал = 0 
and Ina = 1. 


+4 rind 
approach of considering 


tates represented by the points (7, 8). Thus the progress of к; 
epidemic can be regarded as a random walk from the point (n,a) to the points (n — 1 
ш = 0,1,...,n, with an absorbing barrier at у = 0, and where the possible transitions fro™ 
(r,s) are 


(7, 8)  (r— 1,8+1), oceurring with probability "(е +p), 


occurring with probability Plr +p). 


r P, can now ђе written 
€ probabilities of a]] possib] 


and (7,8) (r, 5 — 1), 


Foster's general formula fo 


down al i is simply bY 
considering the sum of th КОВ immediately 


6 paths from (n, а) to (n—w, 0). ТЕР 
we have 
т 
a+w 
Ван 20. № = 6 
W PEER Ab n+p Х(р--») HUBS b Mi, a ) 
w 
E the Summation is over ај compositions of 4+w—1 into 241 arts such that 
‚< — ; В 
д and 159, афр However, for th : "poses of com” 
putation there appears to be Some advant, » ‘or the purpos 


quantities P, from (11), (14) and (15), in 


5 е 
nis at all large, in calculating t” 
only is the form of ( 


f ad of from (16). The reason for this is that ev à 
°F Computation, but also the partitional nature 
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the summation may leave some doubt as to whether all relevant terms have been included 
in any specific instance. Using (11), (14) and (15), therefore, the P, have been calculated 
Over a suitable range of values of p, for n = 10, 20 and 40, and taking а = Таз a standard 
initial condition. Some typical results are shown in Figs. 1, 2 and 3. It can be seen from 
Figs. 1, 2 and 3 that when the relative removal rate pis large epidemics tend to be small, and 
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Fig. l. Diagram showing the probability of the 
final total size of the epidemic for groups of 
ten susceptibles, starting with the introduction 
ОЁ one new infectious case. 
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Fig. 2. Diagram showing the probability of the 
final total size of the epidemic for groups of 
twenty susceptibles, starting with the intro- 
duetion of one new infectious case. 


р=40 


30 40 


Final total size of epidemic (w) 


Fig. 3. Diagram showing the probability of the final total size of the epidemic for groups of 


forty susceptibles, 


starting with the introduction of one new infectious case. 
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conversely. There is a fairly gradual transition between the two extremes, though for rcs 
intermediate values of p most of the probability is accounted for by the two ends of t = 
distribution. For example, with р = 5 for n = 20, there is а 20 % chance of no additiona 

cases and a 64 % chance of 19 or 20. Again, there is only а gradual drop in the average e 
of the epidemic with increasing p. Specimen values for the range p = 0 (0:25) 1-50n are ве 

out in Table 1 below. There is no obvious analogue of the Threshold Theorem derived by 
Kermack & McKendrick (1927) for the deterministic case. 


Table 1. Average total size of epidemic for various values of pandn 
(not counting initial case) 


р n=10 n= 20 n= 40 
0 10-00 20-00 40-00 
0-25n 7:13 14-40 29.12 
0:50% 4:33 7:97 15:31 
0-75n 2-74 4:32 6:94 
1:00». 1:89 2-62 3:58 
1:25n 1:38 177 2:18 
1.50% 1-08 1.30 1:50 


4. HOUSEHOLD DISTRIBUTION OF CASES 
The methods of the foregoing section may also be employed to investigate the distribution 
of multiple cases of a disease in households. This problem was first examined statistically 
by Greenwood (1931) who considered the hypothesis that, with a fairly infectious disease 
like measles, the first case in a family would arise from an outside contact while subsequent 


ough contacts within the family. The period of infectiousness is thought 
sles and if reduced, for the 


advantage, for simplicity in handling th 
in (15) to give Irs аз а, linear function of ; 
are easily found to be E rx 


п-т 
Irs == Tpy-iÀ Фра (s 5 1), 
да = Gro] (7 +p), (17) 
Фа = 1. 


with 


and 
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Table 2 


m=i: P,-p[(p-1) 0 =аја I,=N/p(p+1)* 
P, = 1/(р+1) 
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n= 2: Py=pl(p+2) 
P, = 2p*[(p-- 2) (p+1)? 
P, = 2(2р+1)/(р+2) (p 1* 
dL  aj42a,, 2a, Жауға) N 
др“ p  3p*l р+1 p*2 


n= 3: Po=pl(p+3) 


dL _ а,+9ау+3а„, 2а, (150+ 24р +8) аз _ З(аг +а5) _ (а, +а,+аз) _ N 


dp^ p *?р+3 ps4 l2p2+8p+2 — pl mid ni 


®=4: Рә=р[(р+4) 

P, = Ap?[(p +4) (р+3)* 
P, = 129*(2p + 5)/(p +4) (р +3)? (P +2} 
P, = 24p (5p? + 27p* + 41p + 21)/(p + 4) (p - 3Y* (p--2Y (0+ 1 
P, = 24(14p° + 93p* + 235p! + 2039? + 197p? + 74p + 12)/(p + 4) (p+ 3* (0+ 2) (p+ 1)! 
dL _ ад+3а + За, +4ау аз. (15p?+ 54p + 47) аз 
do Pp 3p5 5р3--97р%--47р--27 

+ (8405 + 465p* + 940p? + 879р° + 394p + 74) аа 

1p + 93p* + 235р* + 203p? + 197p* + 740+ 12 

4(а,+а,) _З(а; +2 + аз) _ 2(а,ғаз жазға) | N 
p+l р+2 p+3 рға 


Шы. ы. <<. im ада Жы == " 


п-б: Py=pj(p+5) 
P, = Bp*[(p-- 5) (р + 4) 
Py = 2002p + Т)/\р +5) (р +)" (0-3)? уре 
Р, = 60p'(5p? + 42p? + 116p + 106)/(p +5) (0+4)? (0+3) (p +2) 
120p5(14p*-+ 1778 + 910p* + 2443" + 3626p° + 2836p + 918) 
Py прав) (p+ 4)? (р +3) (0+ 2)! (0+1) 4 
190(49р1о-+ 50Gp9 + B604p* + 12,240p" + 25,941p° + 36,144? 
+ 34,061р*+ 21,9525? + 9456p? + 2448p + 288) 


ака (p45) (p-F 4» (p 3) (p-- 2 (0+1) 
а; m бр дб нш ба: Boa, БОЗ Вар ба 
gm = A C goa брз + 42p*+ 116p + 106 
(84p5 + 885p* + 3640p" + 7329p* + 7252p + 2836) ay 
34 3626p? + 2836p + 918 


140% 177р5 + 910р* + 2448p 


(420p? + 53645? + 28,832р + 85,680p° + 155,646р5 + 180,720р* 


+ 136,244p° + 65,8560? + 18,912p + 2448) a; 
8+ 12,240p? + 25,9415 + 36,144p° + 34,061 p* 
-- 21,952? + 9456p? + 2448p + 288 


+ 42519 596p? + 3604p 


4(ay--a,-- а) _3(аа+а,+а+) 


5(a +45) 
pti ч р+2 р+3 
Ф(а,+а,+а;+@+)__М 

E р+5 


ре 


184 The total size of a general stochastic epidemic 


On the other hand, with values of n as small as 1 to 5, it is probably just as easy to derive 
the P, straight from Foster’s formula (16). пр 
Исево expect to be able to estimate // and у separately as the asymptotic distr ibution 
of epidemic size for infinite time yields no information about the time scale. For this у + 
should require data giving the time intervals between successive infections in families with 
two or more cases. | В 
Having calculated the P, for any given family size, we can throw the results into a for Е 
suitable for the maximum likelihood estimation of р. N is the total number of families 0 


~ = ahaa e ce die A 
a given size; and a, is the observed number of families with a total of w cases in addition t 


the first one. The case n = 013 trivial. Forn = 1 there are simple expressions for the amount 


i i i " > : : CONS ion 
of information as well as the maximum likelihood estimate, while for n > 2, the informatior 
functions become increasingly awkward to handle, and the simplest procedure is to use the 


well-known method of calculating the observed amount of information from sufficiently 


close values of the score. The values of the Р, and the corresponding score for n = 1, 2,3,4 
and 5, are set out below in Table 2, 


The values of P,, can be conveniently checked by ensuring that their sum for any 


nis unity. Although the scores for the larger n contain some 
in p, there is little difficulty in practice, with the aid of Bari 
machine, in computing the score at a few trial values of 
polation. However, if such methods were to be used 
while considering the construction of Speci 
observations and the coefficients of these о 
a wide range of values of p. 

Suitable data for the application of the above methods do not 
any rate are not readily accessible, apart from the materi 
epidemic used by Greenwood (1931) 


given 
awkward looking polynomials 
low's Tables and a calculating 
p for the purposes of inverse inter- 
at all extensively it would be worth 
al tables. The scores are linear functions of the 


bservational quantities could be tabulated ove? 


Ж "аб 
seem to be available, oF 2 


s discuss 
ther hand, + 5 à lofthe types 


(1931), appropriate to very short periods of 


high infectivity, 
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of epidemiological data. An application to the problem of the distribution of multiple cases 
of disease in a household is also considered. It is shown how maximum likelihood estimates 
of the ratio of removal to infection rate can be obtained from suitable data, and the 
appropriate maximum likelihood scores are given for families up to a total size of 5 (not 
including the first case). The model under discussion is not suitable for diseases like measles 
involving short periods of high infectivity, but its adequacy for other infections requires 
to be tested, 


Tam indebted to Miss Eva Rowland for undertaking the computations, on which Figs. 1, 
2 and 3 and Table 1 were based. 
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EXPERIMENTAL EVIDENCE CONCERNING CONTAGIOUS 
DISTRIBUTIONS IN ECOLOGY 


By D. A. EVANS 
King's College, Newcastle upon T'yne 


INTRODUCTION 


In order to get evidence about the applicability of theoretical contagious distributions, as 
much data as possible on plant and insect populations have been collected together. The 
goodness of fit has been considered for three contagious distributions. They are the negative 
binomial, the Pélya-Aeppli, and the Neyman Type A. The general conclusions are that, оп 
the whole, plant quadrat counts are fairly well fitted by the Neyman Type A distribution, 
while insect counts are fitted by the negative binomial distribution and not by the other 
two distributions. It is suggested that this difference is possibly due to a greater degree 
of competition and overcrowding in the case of plants. In Part I experimental evidence 
is given and discussed. Mathematical formulae and charts required are given in Part п. 


Parr I. EXPERIMENTAL EVIDENCE 


11. Description of data, 


The plant quadrat counts and insect population counts analysed in this paper are arranged 


at eight British localities are given in Table 1, while 


( ree types of American prairie. Some counts of insects 
and larvae are given in Table 3 and two counts of moth eggs are given in Table 4. The 


, 


various sources from which these а. 


(i) Counts numbered 1-24, Table 14, were made by Archibald (1948) and consist of 


ies. The type of community in, 


-30, have been given by Barn 
niform level expanses of virg 
› the waste product of clay р: 
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Table 14. Frequencies for various plant species 
Number of 
individuals la 2a 3a 4а ба ба 7а 8b 9b 105 
0 4 12 15 39 57 82 60 26 == 45 
1 3 8 17 23 6 4 18 36 = 25 
2 8 9 28 17 12 4 19 27 3 ll 
3 13 13 18 13 5 3 2 10 2 4 
4 11 6 9 4 5 1 == = 5 8 
5 9 8 12 4 5 2 = = 8 3 
6 8 11 == = 7 1 1 1 6 1 
7 10 7 = 1 1 = = 1 1 
8 3 8 1 — — 13 — 
3 3 7 - 1 1 10 2 
10 8 3 1 1 5 ve 
11 3 4 -— = 8 
12 1 1 8 
13 4 1 8 
14 == Tz Е 
15 3 = 3 
16 9 1 6 
17 1 = 6 
18 = - 5 
19 = 1 1 
20 - T 2 
21 m a 1 
22 Е ЕТ Ез 
93 2 == 1 
24 = = В 
25 = = 
26 = = 
27 == iuh 
29 Ens = 
30 1 РЕЖ 
31 a = 
34 = = 
N 100 |100 |100 | 100 | 100 100 |100 
hy Et шт” 100 а | 132, 158| 0-67] 068| 1-26] 1059 | 137 
hy 27.56 | 1453 | 268| 2:00) 542 3:37 | 103| 112| 2253| 3-71 
ks 226-81 44-52 2-51 9.61 | 18-85 | 20-79 2:07 1:21 31:08 | 14-00 
Method 1. у 0-51; 243| 404| 0-51) 0-00 1-18 | 171 
Ty = heme ike Er _ 1-27 | —8-22 | — 4:37 | +0-08 | — 0-05 | — 35-5 | — 0:39 
8x, (T) UM 19 12 | 1 тб | 9 0.5 | 0:35} 30 4 
Tra abe |gs | 007 | –100 | 856) +100) +017 —988 | +1-61 
848, (Th A) 44 17 12 1 5-4 4 0-45 30 3-2 
Мей osol 362| 475| 0:66 1:48 
ры жар _0.38 | —1-88 | —0-48 | —0-10 ЕКЕ 
ва, (U, ) 0-29 116| 0:84 0413 0-44 
аз, E 2-09 0-79 2-60 3:24 0-60 1-20 
09%, i ls — 1:05 — 0:27 | — 0-27 +0-53 | — 0-06 20-70% 
S.E. (бы) + 3-0 2-0 0-25 0:64 0-34 0-1 0:34 


№ а 
Ote. Tn all the tables an asterisk ag 
esti tude than twice its estimated stan 


ated standard error. 


ainst th 
dard error: two 


e value of a test statistic indicates that it is greater in 
asterisks denote that it exceeds three times its 
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Table ТА (cont.) 


Т | 
Number of Р ә 
individuels | 115 | 12b 13е | Ме | 158 | 16e | 17] 18/ | 19/ | 20g | 219 
| | 
0 75 | 75 1 | 274 | 88 | — | 181 | 237 |354 | 52 18 
1 5 10 15 71 101 - 118 10 58 32 7 
2 9 3 27 58 101 — 97 78 41 13 9 
3 6 2 42 36 | 84 1 | 54 30 25 3 n 
4 == 4 77 90 54 2 32 23 11 - 5 
5 3 2 77 12 30 3 9| 8 3 8 
6 = 2 89 10 16 2 5 6 4 5 
7 1 1 57 7 16 6 3 3 3 9 
8 1 = 48 6 7 7 1 = = 5 
9 = 24 3 2 6 4 = 
10 E 14 = 1 8 1 1 1 
11 1 16 2 10 
12 9 1 4 7а 
13 3 5 1 
14 1 2 5 
15 7 E 
16 7 о 
17 2 4 
18 5 1 
19 2 1 
20 3 - 
21 2 
де 3 | 
23 5 
24 2 
95 = 
26 
3 
27 z 
29 
30 : 
31 1 
34 T 
ess 
N 100 |100 : 
М tal og |, 0 "s m 800 |500 500 0100 a 
k 2:35 5 i | ў 41) 1:18 0-66] 0: .64 
ks 9:67 | 1930 788 | 1835 | 401 4639) 225) 516 175 | 0-67) 2581 
18-34 7-36 | 250-47, 3-99 9-36 6-37| 0-56 
| 
Method 1:4 | 2.31 3-34 0-11 р = 14 
Ts. -L64|-239 |.018 124 | 065] 226) о5о| 1.29 1-66 | 0:00) 3 
в.) 35 | 7 18 | 27 EC *3041|-069|.-068 | 0.26 | — 0:11 а 
М.А. +0:25 | + 1-96 — 0-14 | 4.1.7 i 5 0-6 ld 0-89| 0-13 
М "13 |-0- _16 
S.E. (Txa) | 2 4 us | їз “ie +668 —044 +166 | 40-64 m 
83 05 | 09 0-6 
А ғы 
Method 9: 49 | 9.94 3-42 66 
0, —0-45 -0.06 2:34 4.6 
в.в. (Up) 0:50 | 0 — 0-14 | 0-78 | 1.17 1:81 8.67 
Qu 64 0-29 =0:26| +0-14 | 0.10 T 4.95 
ар, 219) 2-49 > 0-14 9 
U Й 1:89 0-71 0:15 0:13 41 
N.A. +0-08 | +0-67* 0-70 | 0 
S.E. (U. Ў +0-55* | 0.73 70| 1-00 1.47 0:3 
(Ux) 0-26 0:31 | —0-15 - 
019 | 0.93 | 5| +0-34** | 40.13 3 
| 0-12) 0.1] 0-08 
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Table 1А (cont.) 


Number of 
individuals | 22g | 23g | 249 | 25h | 26h | 27h | 28h | 29% | 30h 
0 12 58 65 101 124 41 
150 
i 22 32 18 25 76 14 34 19 5 
: 19 10 10 26 33 5 28 9 11 
1 17 2 5 36 17 2 8 2 6 
15 3 2 18 2 1 4 2 
5 6 = = 21 — -- 2 1 : 
6 5 6 m 2 = 
7 2 4 1 : 
8 2 6 E g а 
9 — 5 
10 3 
11 1 
М 100 100 100 252 252 64 228 100 100 
a 2-61 0-70 0-61 2-22 0-80 0-64 0-67 0-62 0-63 
E 3-69 0-90 0-99 6-55 0-93 144 1-33 1-29 0-94 
a 5:06 141 1-64 | 19-98 0-97 5-31 3-28 3-64 1-33 
Method 1:4] 042 0-28 0-62 1:95 | 0-16 1:25 1:00 1:08 0-50 
ыл -1-48 0:03 |-045 |-Т785 |-024 0-77 |-0:38 |-007 | —0-47 
T. (Ту) 2 0-32 0-52 4-7 0-19 1-25 0-66 1-00 0-43 
Тум. —196 |4006 |—033 |—364 | —0-23 1.27 |—0-05 |+0:29 |—0-39 
EL (Тура) 2 0-3 0-48 4 0-19 1-20 0-5 0-7 0-4 
М à 
Tothoa 2:49 0-21 0-83 2:86 0-88 1-18 0-98 0-73 
Ue, 4006 |—013 |-203% +0-24 |—012 |40-06 | —0-15 
P (Су) 0:08 0:14 0-79 0:18 0:13 0-16 0-13 
D 0-43 0-20 0-74 2:15 0-24 0-78 1:02 0-86 0-66 
ска, 2004 10:06 |—007 |—044 -096 | +0-30* | -001 | +013 |—010 
S.E. (Ux 4.) 0:54 0-07 0-11 0-49 0:06 | 0415 0-1 0-12 0-1 


Table 1в. List of plant species and localities referred to in Table 14 
Plant Species 


la Salicornia stricta 16е Salicornia stricta 

2 Plantago maritima 17f Carex flacca 

3 Limonium vulgare 18 Briza media 

4 Triglochin maritima 19  Helictotrichon pratensc 

5 Armeria maritima 20g Thymus serpyllum 

6 Festuca sp. 21 Festuca sp. 

7 Suaeda maritima 99 Bromus erectus 

8b Glaux maritima 23 Carex sp. 

9 Juncus gerardii 24 Briza media 
10 Aster tripolium 25h Hypnum schreberi 
11 Triglochin maritima 26 Spergularia rubra 
12 Plantago maritima 97 Juncus effusus seedlings 
13e Glaux maritima 28 Juncus effusus seedlings 
14 Plantago coronopus 29 Juncus tussocks 
15d Carex arenaria 30 Juncus tussocks 

Localities 
Point e Salicornia Salt Marsh, Blakeney Point 


f Limonium Salt Marsh, Blakeney 

Juncus Marsh, Havant 

© Glaux Low, Blakeney Point 
Carex Dune, Blakeney Point 


Chalk Grassland, Pitstone 
g Chalk Grassland, Otford 
h The Devonshire Moors 
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Table 10. Classification of the habitats sampled in locality h, Table Тв, together 
with the size of quadrat used 


25 L, D, Н, 39 sq.cm. 28 L, H, 600 sq.em. 
26 I, D, Н, dense, 39 sq.em. 29 L, L, 2500 sq.cm. 
27 I, L, 156 sq.em. 30 L, H, 2500 sq.cm. 


IlorL Initial or late stage of colonization. 
W or D Wet or dry. 


HorL Highland or lowland areas. 


Salia atrocinerea, the underlying vegetation was completely obliterated by the mica deposits. 
In view of the wind dispersal and the uniform surface one would expect a Poisson dis- 
tribution for a quadrat count of willow seedlings, and this was in fact borne out by à sample 
of sixty-four quadrats of area 156sq.cm., for which the mean number of seedlings per 
quadrat was 0-734. The water used to transport the mica in suspension was derived from 
moorland streams and probably carried in the separate protonemata of the mosses. Each 
protonema then gave rise to a number of upright shoots, and counts of the latter gave 
evidence of contagion, as shown in Table 1А, species numbered 25%. As soon as the dams 
dried out, the herb Spergularia rubra was found. The count given (26h) was taken on ground 
bare of parent plants. It seems likely that small aggregates of seeds were carried in by 
animals, which frequented the dams as soon as they were firm enough. The rush, Juncus 
effusus, was one of the earliest colonizers, and was also one of the most, persistent. The 
count of Juncus seedlings (28h) was made on an older dam with numerous well-established 


but widely spaced clumps of rush, from which seed was possibly distributed by wind апа 
by grazing animals. 


names of the species, » which gives the letter code, used by Steiger, for the 
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i i unts 
Table 34. Frequency distribution for eleven insect population cor 


(data given by Beall) 
| 
| lle 
10с 
5 8c 9c 
ber of 5b 6b 7b 
individuals la za зе es ” -| ы 227 
5 62 ae 
117 205 1 70 
Р 7 190 20 33 : 88 н 
|| | | ame | nja jajaj g 
1 is 16 17 27 | 304 a H 38 4 p 1 
3 18 з | M 9 qd 205 6 5 21 2 > - 
Я 11 15 5 7 204 1 E 7 — = 
5 12 9 4 3 215 1 2 2 | 
6 7 6 1 1 199 2 2 2 
7 8 5 2 1 | 150 2 2 Ra 
8 4 3 2 a 104 ы == 1 
в d p Em = 90 = 
1 1 = 
10 1 3 1 n 2 Еа 
1 —. | — 1 2d 1 = 
12 1 1 Түгі 36 T == 
13 1 — 16 2 = 
и e- 15 1 == 
15 1 11 1 = 
16 - 6 = 1 
17 1 | 10 3 ES | | 
18 - < а => 
19 1 2 
| 
20 = 1 а 1 | 
21 — 1 
22 — 3 == = | 
22 2 өзі | 
2 Е: 1 — | 
25 — 1 — 
26 1 | — E | 
27 - = | 
28 1 => | 
36 = } | | 
| 45 | 
60 1 | 
= | " = 325 
25 
N 120 120 į 120 |190 |2304 70 70 325 325 3 у ой 
қ 8 1 
ky 403 3-17 1-48 1-51 4-74 6:10 2:14 1:40 0:50 a 0,09 
ГА 16-45 7.77 3-19 3:63 | 15-00 | 113-13 | 14-10 2.33 0:58 1.54 
% 155-11 | 19-75 9:67 | 15-63 | 83.83 |3787-50 | 162-75 | 179 0-79 


Treatments. Counts 1-4 were mad 
in grams per acre. 


Count 5 was made 
the treatments give 
ments, as follows: 


Table Зв. List of insect species and treatments referred to in Table 34 


Application Application 
сш on 8 July on 19 July 
0 0 
2 0 40 
3 40 0 
4 40 40 
оп a potato field near Chatham, Ontarj 
n to the beetles in counts 6 and 7. p Tres 


Insect species: а 


b 
c 


8 No spray 
9 Contact insecticide 


The E 
The C. 
The b 


uropean corn-borer, 
olorado potato beetl 
eet webworm, Lozosi 


e on plots given two appli 


10 Leaq arsenate 


ts 8- 


Pyrausta nubilalis Ноа. 
e, Leptinotarsa decemlineata Say. 
tege sticticalis L. 


cations of fungus spores, at the followin: 


son ig given 
gust 1935. No information is giv% tro 


11 were on plots given various che™ 


Contact 5pray and lead arsenate 


g jovel® 


id 
yu 
в004. 
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40dm., in each quarter. The vegetation is described as ‘mixed prairie’, and A. Smithii is 
one of the two dominant tall grasses found on the plateau. 
This concludes the description of the plant quadrat counts. 
" m у Passing on to the counts of insect and larvae populations, counts numbered 1—4 and 
-П in Table 34 have been given by Beall (1940). Опе of the series of counts had to be 
excluded because it was not given in full. Table Зв gives the key to the species letter code, 
and the key to the number code for the treatments applied to the insect populations, 
Counts 1-4 were made in an experiment (Stirrett, Beall & Timonin, 1937) on the control 
of the European corn-borer by the fungus Beauveria Bassiana Vuill. The fungus treatment 
8 J uly occurred at the beginning of the period of oviposition and the second treatment 
one height. The borer lays masses of about twenty eggs, which hatch in July and reach 
growth in August, so that the population sampled was effectively of one age. Counts 


Table 4. Frequency distributions for two counts of the number of moth 


eggs per maize plant (M arshall’s data) È 
Nu " Frequencies (п,) | n, Statistics 
of mae (r) j | 2nd count * 
Cd por еу анов. соп- ME Г 
plant E 
1st count | 2nd count | tinued Counts 15% | 2nd 
b | L 2221 В _ | | | 
0 | 204 | 101 19 | 1 N 780 | 782 
1 | M3 | 72 20 3 ky 2-48 5-75 
2 | 198 | 89 21 1 Ёз | 7:40 39-10 
: 107 84 22 2 ky 39-43 383-55 
5 a FA | a 1 Method 1 
6 32 | 56 | 25 - ám | 198 | 4-58 
T 1 | 43 | 26 1 Ts: 42:67 | 457-25 
8 14 дб | 87 1 S.E. (Tu) 44 | 404 
9 7 28 | 28 Es Ty | -T55* | +117-62* 
10 7 39 2 | 2 S.E. (Tr) | 93 28 
m > |» || = Method 2 
12 3 | 17 | 31 | E etho | 
13 3 12 | 32 1 49. 2:08 5:07 
14 1 14 | 33 — Ux, – 0:93 —2.8 
15 1 12 34 3 в.®. (Оуљ) 0-49 2.6 
16 1 10 — — Ха 11:9 40-3* 
17 | 2 | nmn 51 1 D.F 9 18 
18 | Lh | 7 — — % 9-5 34-2* 
| | | аҙ 1-70 3-62 
| | Ur. + 0:71" + 551 
| | | в.Е. (Ор) 0-34 1:5 
а | | | 


Were made of the number of borers on the unit area occupied by a hill of corn, and there had 
беп two possible periods of migration of the larvae prior to the examination on 19 October. 
The Colorado potato beetle lays groups of 20-30 eggs, and the two counts 60 and 70 were 
Made on the number of larvae per 4ft. strip of potato row. The population sampled ranged 
Tom larvae newly hatched to completely mature larvae. 
The beet webworm lays from one to five eggs, and the larvae usually mature on the plant 
Оп which they have hatched, although they can move about freely. Counts 8c-11c were 
nade on the number of larvae present on unit areas of 3 ft. of row, and the population con- 
Sisteq mainly of mature larvae with some half-grown larvae. Beall suggests that the contact 
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insecticide, acting as an irritant, might possibly increase dispersion, whilst the arsenate 
i d to depress it. 
е, кү has n given an account of some earlier work on the генде ak pomi 
beetle. A plot of potato plants forty-eight rows wide and 96 ft. long was chosen = T 5x 
ination. By running strings transversely to the rows of potatoes at intervals of = n: п 
area was split up into 2304 sampling units of 2 ft. lengths of row. The frequency distributio 
of the number of beetles per unit is given in Table ЗА, number 56. | 
(vii) Finally, Table 4 gives two frequency distributions obtained in a field ва 
study, by Marshall (1936), of oviposition by the moth, Heliothis obsoleta Fabr. Two counts 
of the number of eggs per maize plant were made for all the maize plants in a plot 23 by 
24 yards. All the eggs found during the first count were destroyed, so that the second count, 
which took place a week later, was independent of the first count. 


1-2. Analysis of data 


id rious 
The purpose of the present analysis is to see whether one can choose between variou 


theoretical contagious distributions and say that, for a particular type of data, one form of 


distribution gives a better fit in general than the others. No claim is made that the typical 
form of distribution will always give a good fit. Individual cases will certainly arise where 
the fit is poor, and without further data it is impossible to say whether poor fits are 4 
persistent feature for counts of certain species. 

Because of the poor discrimination between alternative theoretical distributions usually 
shown by plant and insect population counts, it seems necessary to examine ава whole 
a large number of counts of each type, as many as one can get in fact, with no personal bias 


in the selection. The problem is then to detect whether one theoretical distribution always 
tends to err in a particular direction 


› Whilst another gives a satisfactory median fit. 

The Neyman Type A, Pólya-Aep 
fitted according to the methods giv 
chosen as being typical of the vari 


D. A. Evans 197 


fits insect population counts. The explanation of this difference may be that in some 
manner the two types of population are differently affected by overcrowding and by com- 
petition. For the plant and insect populations considered here, the plants are larger in size 
on the average than the insects, and they may therefore be more prone to suffer from over- 
crowding, in the sense that their physical size prevents high densities per unit area from 
Occurring. Competition by other species may be relatively slight in the case of insect com- 
munities on plant hosts if, as seems likely from the relevant literature, only a small number 
of insect species are able to tolerate one kind of plant host. 

The population counts discussed here do not show any indication that a more skew form 
of distribution than the negative binomial distribution would give a better fit. However, 
in the course of searching the literature for insect population counts, four series of insect 
Parasites on animal hosts were found, which showed a greater degree of skewness than could 
be fitted by this distribution. It may be possible to discuss these counts in a later paper, but 
So far no more excessively skew series of this type have been found. 

Referring back to the thirty plant counts given in Table 1 4, it will be seen that only a few 
of the values of the statistics 7 and U (marked with an asterisk) differ from zero by more 
than twice their estimated standard error. Taking the group as a whole, the Neyman Type A 
distribution (N.A.) gives a satisfactory median fit and the Pélya-Aeppli distribution (Р.) 
Blves a significantly high proportion of negative values. Values of T and U for the negative 
binomial distribution (N.B.) are not given, but they also show a significantly high 
Proportion of negative values. 

In order to investigate the general spread of the statistics T and U for the Neyman Type A 
and Pélya-Aeppli distributions, the value of the more efficient statistic for each count was 
divided by its standard error and converted by a table of probits into a probability value. 

his use of a probit conversion is only strictly valid if T and U are normally distributed and 
the Standard error precisely known. One would expect, however, that the error committed 
18 not serious, It may be as well to stress here that all the theoretical results used in this 
analysis are based on large-sample theory. On the null hypothesis that the distribution 
tted is the correct one, a rectangular distribution in (0,1) would be expected for the 
Probability values. After dividing the interval into five equal parts, this hypothesis was 
tested by the 3? goodness of fit test. The values of x? (with 4 degrees of freedom) were 5:7 
апа 12 for the Neyman Type A and Pólya-Aeppli distributions respectively, so that the 
atter distribution does not give a satisfactory fit. к . А a 

The conclusion, that the Neyman Type A distribution gives a satisfactory median fit, 
May be verified by calculating and comparing expected and observed group frequencies, 
and this has been done in those cases where there are a large number of observations. The 
Tesults are given in Table 5, using the same key as in Table 1a. The Neyman Type A dis- 
tribution was fitted by using the sample mean and proportion of zeros (Method 2, Part II 

elow), except for count 13c, where the fitting was by moments (Method 1). Expected 

frequencies for the Pólya-Aeppli distribution fitted by Method 2 are also given for count 146. 
he value of үз for the goodness of fit test and its number of degrees of freedom are given 

at the foot of NE count. The total of 53-8 for y? with 42 degress of freedom is not significantly 
агре, nor is there any consistent failure in fitting at certain points, so that the conclusion 


Elven a i i ited. я 
Беа at for count 14c, the x? test is sufficiently sensitive 


It may b ing on th 
ted before passing LL | 4 
9 “њи у а а two forms of distribution—the result agreeing with the test 
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based on the U statistic. The U test is also able to discriminate between the two forms of 


distribution for the count 18f, but here, as is more usual, the x? test fails to do so. 


Table 5. Expected frequencies for the Neyman Type A distribution fitted to some 
plant population counts from Table 1 


13c 14c 15d lif 
Number of | - ——— 
individuals ur кендеген У | ri 
Obs. | Exp. | Obs. | М.А. | Р.А. | Obs. | Exp. | Obs. | Exp. 
Г 0 | m 19.8 | 274 274-0 | 2740 | 88 | 880 | 181 | 181-0 
1 71 58-0 750 | 101 | 105-0 | 118 | 1271 
2 27 | 284 | 58 | 589 | 515 | 101 99-9 97 | 890 
3 42 | 513 | 36 | 436 | 344 | 84 78-2 54 52.0 
4 77 | 707 | 20 | 976 | 227 | 54 | 539 | 32 | 973 
5 77 | 796 19 16-4 14-9 30 33-8 9 13-2 
6 89 | 758 10 95 | 96 16 19-7 
" T 57 63-1 7 54 6-2 16 10-8 | 
8 48 | 46-7 Т 5-6 | 
9 24 31-2 | 9 | 104 
10 14 | 191 12 6-4 10-8 | 
$i 16 | 108 3 52 
12 апа оуег 18 10-7 | 
| 
ра | | 
2 500 500 500 500 
ж 12-2 128 | 24 zs 3-8 
D.F. 9 6 6 7 4 
| жыл 1 
--%- 
18 19 
Number of / / 25h 26h 28h 
individuals ES T | — 
Obs. Exp. | Obs | Exp. | Obs Exp. | Obs Exp. | Obs Exp 
| Кы | NER 
0 237 | 9370 | 354 | d | 0 
1 110 | 1029 | 58 | gr | E | 5 E E Et 
78 73-8 4 е " | 10 77-8 3 .8 
: 30 | 428 эр | 368 36 ота | 38 | 387 x 11.0 
j \ \ 
5 s ne и | 126 18 20-9 
6 | 21 14-8 
7 | | | 6 10-3 
8 | 
| 10 , ў 
9 б 14 9T, п 10-6 Шы; 19 16-4 8 9 
10 
11 9 
12 and over E 78 
N 500 | | Lax 3 | | А 
Кк | | 500 | 252 
| uk | | i 0-7 | | 252 | 228 22. 
| | 3 іы | 0:5 2 
| 1 
Note. The theoretical distributio 


to whi Я ; 4; олова fitted 
о which a Pólya. Aeppli distribution was Er АШ Neyman Type A ee iaceo д © ount 
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None of the three contagious distributions considered here gives a satisfactory fit to the 
thirty-eight plant species referred to in Table 24. When the fitting is done by Method 1, the 
Neyman Type A distribution gives the best fit as far as third moments are concerned, but 
goes wrong at the lower end of the frequency distribution, predicting too many zeros in 
thirty-four cases out of thirty-eight.* The negative binomial distribution would predict 
about the right proportion of zeros, but would generally be far too skew. 

These conflicting results may be due to a number of causes. The communities sampled by 
Archibald were well-known natural forms of a more or less stable character and were chosen 
for their uniformity of life form. There were therefore no difficulties of sampling due to 
widely differing life forms. The degree of cover is more or less complete in the chalk grass- 
lands, and especially in the Salicornia and Limonium marshes, where the vegetation forms 
а low compact mat. The structure of the prairie vegetation showed great variation, with 
Several species at times locally dominant. The character of this community is entirely 
different from that of the communities studied by Archibald. Many of the grasses, the 
Andropogons and Boutelouas in particular, are sod (that is, tussock) forming with foliage 
from 30 to 65 cm. high. The formation of dense sods of Bouteloua is encouraged by the annual 
mowing of the area. Andropogon furcatus has a rank growth with widely spreading tops 
producing almost complete cover, yet it may occupy only 20 % of the surface area. In the 
case of the sod-forming species it seems certain that Steiger has given the number of flowering 
Stems per quadrat, but he is not definite about this. 

No experimental field work seems to have been published about the effect on the shape 
of the frequency distribution produced by using samples based on different quadrat sizes. 
Consequently it is not easy to judge the significance of the fact that Steiger’s quadrat size 
is 500 times as large as the 20sq.cm. quadrat used by Archibald, From the mathematical 
Point of view, the three distributions considered here are only additive with respect to the 
parameter m; that is, if two independent random variables have the same type of distribu- 


tion, but with parameters (701, а) and (ть, а) respectively, then their sum also has the same 


type of distribution, but with parameters (m4 + тоа). 

Although the quadrat size was so much larger, Steiger only found from ten to twenty 
Stems per square metre for each of the more abundant species, and about 300 stems for all 
Species combined. The corresponding figures obtained by Archibald were one to two and 
fifteen, so that the English communities had about twenty-five times as many individuals 
Per unit area, with fewer species having about the same, high, average number of individuals 


Der unit area. 

It is possible that the Neym 
that the annual mowing and the 
individuals of each species more evenly th 


Observed. | . mos | 
+ the negative binomial distribution is the appropriate 


An alternative conjecture is tha го 
distribution, and that the upper tail has been compressed because any high densities are 
; вресіев. 


Physi а large, rank-growing, 
ir wid n nw of zeros and E(n) is its estimated expected value obtained 
0 


by fitting the distribution by moments, then (по) – no = V say, and U = k,—k,(1+é) 


an Type A distribution is the correct form of distribution, and 
higher number of equal competitors have spread out the 
an usual, thereby reducing the number of zeros 


* Le. thirty-four out of the thirty-eight values of Ел (По) — По Shown at the bottom of the table are 


Positive, 
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have the same sign for any of the three distributions considered here; and further, since 
Оза > Up, > Uy p, (4 > 0), it follows that 


Бул (по) — no > Ep (по) — no > Ex в (пр) — 7. 
This means that the statistics U and V can be regarded as equivalent as far as the study 
of the proportion of positive to negative signs is concerned. Thus if thirty-four positive 
values and four negative values of V are found for the Neyman Type A distribution, thirty- 
four out of the thirty-eight values of U would also be positive. 
The statistics Up, and Ux 4 are given for comparison for the nine low-prairie, non-grassy 


species. Their estimated standard errors have also been given, although for М as low as 
40 they will not be very reliable. 


Table 6. The Neyman Type A distribution fitted to five high-prairie, 
non-grassy species from Table 2 


4P БЕ 6H 7Cs 8Am 
Number of 
individuals — + | | 
Obs. | Exp. | Obs. | Exp. | Obs. | Exp. | Obs Exp. | Obs. | Exp 
l g $ | 80 | 35 | wa | 80 | 200 | irl зао | s | 0 
| 1 12 7-7 » 10 7-4 5 2.9 
2 1 T8 12 10-5 | 11 7-4 6 6-0 4 ra 
3 1 3 
% | 7 | пә } т | 19 } 4 | | - i 23 
бала оуег | 8 T3 6 66 | 5 | | 5 58 
1 — 
x ал 0-4 Я == 
| DF 2 1 ? У di жел 
| ЕЕ и Me 
| Usa. Tem +4 +89 42:9 +18 


bothind: : “he Pólya- Aeppli distribution give? 
tion mA * better fit would be given by a more 
ounts given in Ta 
Satisfactory fit t 
distribution and 


Р ivê 
ble 3.4 will now be considered, The nega” 
© these counts, 


р Jls 
the Pólya-Aeppli distribution (which he © 
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Pélya Types 1 and 2 respectively), he got his parameters interchanged and was consequently 
unfair to them. Furthermore, his method of estimation is always by the first two moments, 
and this method is seen to be particularly inefficient in the case of the Colorado potato beetle 
counts (5b, 66 and 70). 

In addition to the 7 and U statistics, extensive calculations of expected values were 
carried out for the distributions in Table 3 А. Expected values were calculated for the negative 
binomial and Pólya-Aeppli distributions, fitted by both methods, but restrictions of space 
Prevent the presentation of this information in full. Table 74 gives the values of the fitted 
Parameters and of X? for both methods of fitting. for both distributions. None of the values 
of x? is significant for the negative binomial distribution, both methods of fitting giving 
reasonable values of л. The total ү? of 55 for the preferred method of fitting has 50 degrees 


Table ТА. Statistics for the negative binomial and Pólya-Aeppli distributions fitted 
by Methods 1 and 2 to the insect population counts of Table 3 


- - 
Count. la | 2a 3a 4a 5b 6b 7b 8c 9с 10c lle 
120 120 | 120 | 120 2304 70 70 325 | 325 | 325 325 
{ | Ж |» 
аш 3-08 | 115| 141 2.17 17:55| 558| 0:66) 016| 034| 0-26 
Тұ, | 4373 |—106 |—09 | +18 | +38 |—296 | —86 | —069| –0-05 | —0-37| —0-07 
S.E. (Ту 36 8-8 30 | 41 55 1200 89 0:80| 0:09] 0-28) 0-11 
M 6-8 m 1.2 то | 211 37 019| 52 143| 42 0:33 
De 7 5 3 3 18 4 2 4 1 2 1 
аз, 3-07 242| 101| 143 2-22 1277| 520| 0-82] 0-20] 0-48] 0-31 
Оқу, +00 | —3-1 |4032| —0-03| -095 | +291 |+08 | —0-22| —0:02| —0-12| —0-02 
8.8, (U) 2-7 1-8 0411 0-56| 069 25 | 37 0:20} 003| 0-08} 0-03 
2 6-8 4-9 1.3 71 21-6 14 0-08| 53 151| 37 0-11 
Der, 7 5 3 3 18 4 2 4 1 2 1 
ne - 
| 
T 565*| —T. 0-1 | +33 | +14-9* | +643 24-7 | —0-38 | —0-04 | —0-32 | —0-06 
В.Е. (Ту Би uU, TES 31 m 1060 49 0-75| 009| 026| 0-10 
x 8-6 3-6 2.8 69 | 41-9 505%) 81*| 38 1:30] 3:5 | 0-20 
Dw, 7 5 3 3 18 3 2 4 1 2 1 
dp 9.38 1:04 | 0-89| 1:22 1:80 тла| 370| 0-74] 0-20] 0-45] 030 
Us. 428 | —15 | 40-39 | 4028] +176*| +59%8*| +40% | —0-11 | —0:02 | —0-10 | —0-01 
З.п. (U) 1-9 12 0.32| 039| 0-48 59 1:9 0-16) 0-03) 007) 0-03 
x 50 24 22 57 | 459* 10:3* | 2.0 37 134| 2.9 0:04 
D.F, 7 5 3 3 18 4 2 4 1 2 1 
| 


н Note. For the cases marked f, Method 1 is expected to be the more efficient method of fitting; for the 
“maining cases Method 2 is preferable. 


9f freedom and is not significantly large. Two values of x? are significant for the Pélya- 
Aeppli distribution (counts 55 and 65), and the total y? of 77 on 50 degrees of freedom is 
Significant at 1%, If, however, the large significant д? for count 56 is removed, the total 
Nis found to be insignificant at 36 оп 32 degrees of freedom. A significant value of у? produced 

У а poor method of fitting only occurs once, in the case of count 76. 

: The fits by the preferred method to counts 5) and 66 for both distributions have been 
Siven in Table 75. The negative binomial distribution gave similar results for the other 
sect, Population counts. This table also illustrates a persistent feature for the counts as 
a Whole, which is that an observed cell frequency is usually either greater than, or less than, 
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the expected frequency for both types of distribution. Only very rarely is the observed 
frequency greater than one expectation and less than the other. 

The negative binomial distribution gives a satisfactory fit to Marshall’s first moth a 
count, given in Table 4. The values of T and U are both greater than twice their estimate 
standard errors for the Pélya-Aeppli distribution, although x? for the distribution fitted by 
Method 2 is only just greater than expectation. 


Table Тв. The negative binomial (N.B.) and Pélya-Aeppli (P.) distributions fitted 
to two insect population counts 


Count 5b Count 5b continued | Count 6b 
| > x "Ec 

| | | | 

r | Obs. | NB. р. | г |05. | хв. | к т Obs, | NB. | > 
m — | | E pen кене енен a 

0 | 190 | 1855 | 237-3 12 | 29 | 377 | 400 0 20 | 20-0 200 
1 | 264 | 277-3 | 2588 13 36 | 982 | 29-5 1 11 8-9 к 
2 | 304 | 3023 | 2758 14 19 | 209 | 21:5 2 2 | 107 91 
3 | 260 | 2886 | 2682 | 15 | 12 | 154 | 155 3 18 
4 | 294 | 9560 | 245-1 16 | 31 | ave | tis 4 08 
5 | 219 | 2167 | 2139 | 17 6 8-3 79 | 5 8 | 95 |1 
6 | 183 | 1776 | 1801 | 18 | 10 61 5-6 6 0:0 
7 | 150 | 1424 | 1473 | 19 | ; 7-10 в | 14 | 15% 
8 | 104 | 1116 | 117-6 20} o: p 16 66 | i118 8 | 74 a 
9 90 86-4 | 92-0 |оуег20) 11 . -8 | { 5 6-1 S 
10 60 662 | 707 | | 9» Pr ша || cB m 
п | 46 | 502 | 535 | yn | 203. | аз | м ve 

| ла 

1 | | | 


, Observation — expect? the 
: 5. Marshall mentions 0% 8. 
У be in error because of the minuteness of the 900 
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pointed out by Anscombe (1950), whoalso gave the criteria, Гапа U, of thetests of departure, 
with particular reference to the negative binomial distribution. These criteria have been 
described above in the first part of the section on the analysis of the data (p. 196). 

Some results for the Poisson limiting form of the three distributions are given after the 


following section on notation. 
2.2. Notation 


N is the total number of observations in the sample, 

т, is the number of observations equal to 7 (r > 0), 

ky, ko, kg are the first three k-statistics, 

Е is the probability of observing any non-negative integer 7, 

Ki Ku) denote the ith cumulant and the ith factorial cumulant, and are defined by 


оо p ți © kti 
haee SS, вазе ST. 

The estimates of a parameter by Methods 1 and 2 are distinguished by attaching an 
&ppropriate upper suffix, in brackets. For example, @® and 49 denote the sample estimates 
of the parameter a by Methods 1 and 2 respectively. It may be noted here that both 40) 
and 4% should be non-negative. If either of the estimates is found to be negative, the value 
Zero is arbitrarily assigned to it, implying that a Poisson distribution is being fitted. To 
distinguish the three distributions, negative binomial, Pólya-Aeppli and Neyman Type А, 
Suffix initials, for example @ з. 40, апа @ л., are used when this is required. 


2.3. Poisson limiting form of distribution 


All three distributions have mean m and variance m(1+a), and they all tend to the 
Poisson form as а> 0. The values of the variances of the various estimates and statistics 
hree distributions, and these are given below. The 


have the same limiting values for all t 
have been obtained by treating #10 — т, 400 — a, and 


large-sample variances and covariances 
49 — a as infinitesimals: 
N var (f) ет, N var (09) ~ 2, N var (09) ~ 4(e" — 1—m)/m?, 
N соу (0,0%) 0, соу (й,@®) 0, N соу (4049) ~ 2, 
Nvar(T)~6m%, Nvar(U)~4{e"—1—m— mi}. 


2-4, Negative binomial distribution 


t m 
P= Qrajt (5) (+r) jr r(? (r > 0), 


д 
т A80" кыл "ны, > | Т 
к, = Шр > as“, Кра = а(1 +а) да. к» Kin = (r 1)! ma, 


Estimati 41 
ion by Metho Anak, @= (6,1) — 1, 


N var (ii) = m(1+a), N var (a) w2(1+a)?+a(1+a) (2 + Зајјт, N cov (Ñ, a) ~a(1 +a). 


Estimation by Method 2 
m= k, dis given implicitly as 
Gin (1+4) = kalin бїт), 


N var (qo), 2009. (1 жалай +a)na—1 — име soma +а)}?, 


N соу (fà, 49) ~a(1 +a). 
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Let k,/In (№ о) = c, then 4% is the root of the equation 
f(a) = a—cln (14-a) = 0. 


A unique non-negative solution exists if ¢>1. An iterative formula based on Newton’s 
method of tangential iteration is 


ie © +а@)1ш(1+а,)—а,; 
{+1 jm а;—с ? 
where a; is monotonic decreasing towards 49 if aj is chosen so that f(a) is small and positive. 
А one-page table of a/In (1-- a) = c makes the task of finding 49 much easier. Table 8 gives 
values of ¥/a for log,)¢ = 0-05(0-01) 0-65. Linear interpolation may be used for logo ¢ > 0-20, 
giving values of a correct to four significant figures with a possible error of one unit in the 
last place. 


Table 8. 4/a in terms ој logge, where а is such that afln (1 +a) = с 


| | ы ~ | 
logy ¢ Ја A’ | A" | logigc За A’ | logc ға A’ | loge ўа А 
| a 

+ - + | | Ж = | 
0:05 | 0-6330 | 44, 0:20 | 1087 | ogy | 0-35 | 14778 |, 0-50 | 1:8445 | 251 | 
0:06 | 06770 | 403 | 37 | 0:21 | 11344 | 254 | 0-36 | 1-5019 es 0-51 | 1-8696 | 952 | 
007 |07113 | 37g | 27 | 0-22 | 11598 | 525 | 0-37 | 1-5260 | 241 | 0.52 | 1-8948 | 254 
0:08 | 0-7549 | 35g | 23 | 0-23 | 1-1850 | 252 | 0.38 | 15502 | 212 | 0.53 | 1-9202 | оба | 
0:09 | 0:7902 | 336 | 17 | 0-24 | 12100 | 250 | 0-39 | 1.5744 | 242 | 0-54 | 1-9456 | 256 | 
010 | 0.8238 | 555 | 14| 0:25 | 1.9348 8 | o40 | 1.5986 242 | 0.55 | 1:9712 | 257 
011 | 0-8560 | 310 | 12 | 0-26 | 1.2595 | 247 | 0-41 | 1.6229 | 243 | 0.56 | 1.9969 | 258 
012 | 08870 | 559 | 11 | 097 | 1-2840 | 245 | 0.42 | 1-6472 243 | 0.57 | 2.0227 | 260 
013 | 09169 | 202 | 7| 0-28 | 1-3084 | 244 | 0.43 | 1.6716 | 244 | обв | 20487 | 261 
0-14 | 09461 | 297 | 8] 0-29 | 1.3328 M 044 | 1-6960 | 244 | 0:59 | 2-0748 | 262 
015 | 09745 | 278 | 8| O30 | 13570 | 242 | 0-45 | 1.7206 | 246 | 0,60 | 2-1010 | 364 
016 | 1-0023 | 572 | 6| O31 | 1-3812 046 | 1-7452 | 246 | 0.61 | 21274 | 266 
017 | 10295 559 | 4| 0:32 | 14054 | 242 | бал | 1-7699 | 247 | (бә | 2.1540 | 207 
018 |L0503 55, | 4| 033 | 154206 | 242 | 0-48 | 1.7947 | 248 | оез | 9.1807 | 269 
0-19 | 1:0827 | 295 | 4| 0.34 | 1.4537 | 241 | o 10: | 248 | 063 А: 

260 1:4537 | 0-49 | 1.8195 `6 2.2076 | 970 
0-90 | 1-1087 -35 | 241 во | 294 | 255] 2 

| 0-35 | 14778 | 0-50 | 18445 | 250 | 0.65 | 2.2346 

In order to interpolate for subtabular values, ^ 


use Bessel's formula: 


, 01-0 
Л = fo+ 0a; - 100 (А). 
Except where given, second-order differences may be neglected. 


The exponent of the negative binomial distribution, defined by 


could have һе i | 2 
еы Ж — Sinan of the parameter а. The equation for the estimate 4? is simpl 

pra > the equivalent equation for £9 = д Jan, Te mas t ted that for the 
negative binomial distribu е у be пође 


tion both £45 cA ^ . 168 
with the fully efficient өскін к |09 and 159 are uncorrelated in large samp 


Tae af a combined ёар 


16 may happen that neit} m 


rde” 


ing the two estimates d? and a in 9 


a, = a  w(dtà д 


D. A. Evans 205 


The variance of d,, is a minimum for 
var (40) — cov (4949) 
var (40) — 2 соу (4049) + var (49) 


w 


The value of the minimum variance is 
var (6) = {w° var (49) — (1 —w var (@)}/(2w — 1), 
and for the negative binomial distribution, 
ата) [| _ m+a 


М ; (4). 49) ~ 
ушунан) m | а-(1-а)іһ(1--а)/ 


are given in Fig. 1. The bounding contours 6) = 90% 

oe 6, = 90 % given in this figure refer to the efficiencies of 4? and / as estimators of k. 
ey are redrawn from Fig. 1 of Anscombe (1950). The contour &,, = 90 % refers to the 

efficiency of the estimator 5, = 4,» and indicates the expected increase in efficiency. 


C У А "- . 
Ontours of this weighting function w 


for the negative binomial distribution. 


ction 10 


Fig. 1. Contours of the weighting fun 


If estimation of parameters is by Method 1, we may compute 
9 А 


T = + эл, 
s deviation from zero by 


and test the significance of it 
1 +a)? {2а(5а + 3) + 8m(1 + a)}. 


N var (T) ~ 2m(m + a) ( 
If estimation is by Method 2, we may com 
U = h(t), 
on from sero by 
(1 а) (1 +@)— (1+ За) 
N var (0) ~ тт ва) пайа) | 
(O +a)a* аа ие — (m 4- 1 +а)). 
+ гра) ш@ +а)- ај ! 


pute 


шц test the significance of it& deviati 
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Contours of the standard errors of T and U for У = 100 are given in Fig. 2. The line ЁЁ 
in Figs. 2, 3 and 4 represents the contour along which the efficiencies of Methods 1 and 2 for 
estimating the parameters of the distribution concerned are equal. Method 1 is to be 
preferred in the area to the right of this line, and Method 2 to the left. Logarithmic inter- 
polation is satisfactory. For sample sizes other than № = 100 the standard errors stated 
must be multiplied by 10//7. 


Fig. 2. Negative binomial distribution, Standard errors of 
. Т and U for N=100 
For other values of N, the standard error is to be multiplied by 10/yN . 
Illustration of use of Fig. 2 


Let us take the case of fittin 


i g a negative binomial t 3 th egg? 
given in Table 4, using Method 2 А аа 


. It is seen that 

> M@=hk=5-75, qe 5:07, 
The chart shows that the point (m 
that for N = 100, s.p. (U)is 
error of U for N 
above is 2-6, 


10/4N = 0-358. 


2-5. The Pólya- Aepyli distribution 


7—1 
В -ер[- |, B=, 4m v «al Р |ж 
2+a T 0 | aA pw (r>1), 
where | 


X = Кид +а) т. 
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r 
к, = т х a,(r) as, 
ja 


h 5 % 
pol ar) X (– у (2) ив 
" и=1 и 


| a(2+a) 0 
EET т(а|9у-і, қыт (ғаз ( 2 12) кы 


The probabilities Р. may conveniently be calculated from the recurrence relation 


E 2 4 
.= (2 ie У al, ,—o)3, 9 (722), 
Where l =rP, а=ај(2+а) and У = 4m/a(2+a). 
Alternatively, P = Роу у M(r+1,2,¥) (rz1) 


Where M (о, у, x) is the confluent hypergeometric function defined by 


x a+1)a° 
Ма, у, а) = 1425 48 = E 
i 71! y(y+1)2! 
A table of М(а, 2, a) fora = 1(1)4and fora = 0-00 by various increments up to 2 = 8:0, was 
given in the British Association Report for 1927, p. 233; and more recently Rushton (1951) 
as given a table for æ = 2(1)40. 
Values of the cumulative probability function can be found from the relation 


R R][2m)| = E 
ERR Za (gra) Вы Ret) dens 


where Z (p,q) is the incomplete -function. 
The following approximate relations for the cumul 


for checking purposes: 


ative sums of P. and of rP, were used 


t B-L —f(24- (Y — 2) R) a — 1) P; 
Pea- (24-1) Pn oy у pact ТЗВ | 
0 


2 


" G-a} 
d R-1 2 
And also RA рет B+ (R-1) Ра бај - RE ja 
0 0 


By Writing down the log-likelihood function L and equating the quantity 


aL . oL 
2m(1 +а) = 72 +) да 


to zero б btains the solution ћ = №. Because the probabilities P, are in the form of 

a Series, ту : inte possible to obtain convenient expressions for @ and its large-sample 
А sno 

Variance, 


Estimation by Method 1 


We have f dy, 69 = (hs) L 
N var (fi) = m(1 +a) N var (40) ~ 2(1 +a)? +а(1 +a) (2+a)/m, 
var (m) = Ы 


N cov (0,4%) ~ за(2 +a). 


208 Experimental evidence concerning contagious distributions in ecology 


Estimation by Method 2 


We have Ф. = 1, 49 = {2 а (М то} 2, 
2+a\? 2 у 
ЕСУ 2 4 a)? (е2лт\®+а) _ 1)— 4m), 
N var (49) e ) ((24-a)*( 


N cov (т, 49) ~ 1а(2 +а), N cov (40), a) ~ (2 +a)? (m 4- a)] (2m). 


Fig. 3. Pólya-Aeppli distributio: 
For other values of N, the st 


n. Standard errors of T and U for N — 100. 
andard error is to be multiplied by ШАРА 


If estimation of parameters is by Method 1, we m 
T = k, —k,(342— БУЙ 

and test the significance of its deviation from zero by 
N хат (Т) ~ 6(1 +a) m? + 9a(2 +a) 


m{(5a2+ 10а 4- 4)m +a(1 +a) (2 + a)}/4. 
If estimation is by Method 2 


; We may compute 


U = 1,1 (1440 
and test this by 2— l(1-4- d), 


3 (ај, oni 
N var (0) ~ 4 fe 1-1 - 201 а) (2+а)т+ (a2— 2) m. 


ors of T and U foy N 
the standard errors, 


) 6080 у 
; 2. — 4. »- = 100 are given in Fig. 3. This chart 0 ‚ Ө 
€ used to give rough values о as illustrated above in the case of Fig: 
A distribution 
a от ж 
P, = ета v J. (me-a\i 
Um ACD) (r2 0). 


2-6. The М. eyman Tyne 


| 
| 
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Beall (1940) suggests that for computation this expression is more conveniently put into 
а recurrent form. It may easily be verified that 
mee т а" 
Бат теі 2 ul B. (> 0). 


The following formula gives a means of checking the calculations: 


ari т А. 
В = EA where A = meja. 
rm E 


| It was also found useful to accumulate the sums of P, and of r£, which are monotonically 
i : : s f 
Nereasing towards 1 and m respectively. Approximate relations such as 


R R 
Nope (i BR) <m 
1 1 


are easy to derive. 
b Alternatively, by using a table o 
Y Molina (1947), we have that 


f the cumulative Poisson distribution, such as that given 


= © j 
©] Р-1-ееду P(c, aj) та), 
r=0 E j-0 J* 
Е © glai (aj i 
where Р(с,ај) = У c 
4=с Ы 
r Asor 8. E 
к, = т 2-4 Ка = ( ғаға) кы Қыз mar, 


By writing down the log-likelihood function L and equating the quantity 


oL oL 
m(1 +а) 59% 


It is not possible to obtain convenient expressions for 
use of the series form of the probabilities Р.. Shenton 
he efficiency of the method of moments. 


t А 

2" Zero, we get the solution m = ky. 

У oan its large-sample variance beca 
949) has given an upper bound to t 


imation by Method 1 


е have hak, = (4/14) — 1, 


), Nvar (020) ~ 2(1 +a)? + a(2 +a)/m, 


N var (№) = т(1+@ 
N cov (in, 4) ~ а. 


Estimation by Method 2 
We have @® as the solution of the following e 
1-6“. In (Мо) 
E бы 


quation in a: 


The iterative formula ашы = 0706 75 
Where с — Баћа (Мир), may be used. It is easily seen that a; is monotonic increasing or 
®creasing towards the solution 48 if a, is respectively less than or greater than 09: 
2 at р" т 
НИНЕ = а маи, 
N cov (ф, 09) ~a, № cov (40, аз)-а”е4-1- mj[m(Q +a) e7* — 1}. 
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If estimation of parameters is by Method 1, we may compute 
Т = k,—k, + b, — а, 
and test the significance of its deviation from zero by 
N var (T) ~ 6(1+a)? m? + a(18 + 50a + 25a? + 2a?) + 2a?(3 + a) т. 
If estimation is by Method 2, we may compute 
U = k,—k,(1+d), 


and Nvar(U)~2m*{(1+-a)°— (61) та ТТ ПЕСЕН јер“ 


where а = (1—e~*)/a and р = 1-(1--а)е-4, Contours of the standard errors of 7 and U for 
N = 100 are given in Fig. 4, which may be used as illustrated in the case of Fig. 2. 


Fig. 4. Neyman Т. 
For other valu 


ype A distribution. Stan 


es of N, the standard error is to be multiplied by 10//N. 


dard errors of T and U for N= 100. 


This subject of research Was suggested b 
pleasure in thanking for his helpful sugge. 
Mr D. A. East for preparing the іле 
tabulating the raw data given b 
Mrs E. H. Laurie for help in co 


ћ 
У Mr F. J. Anscombe, whom I have ш 
tions and criticisms. I would like to t5? 
. + x: i а 
figures for publication, and for the computing involved" 


y Beall (1938) anq М p 
: arshall (1 ы ot 
mputing the contour: VE ra peine: 
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The first draft of this work was completed during the tenure of a post-graduate research 
ae made by the Department of Scientific and Industrial Research. The investigations 
ading to the results given in Tables ТА and тв were made possible by a grant from King’s 


College Research Fund. 
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MISCELLANEA 
Time intervals between accidents—a note on Maguire, Pearson and Wynn’s paper 
By G. A. BARNARD, Imperial College, London 


Our object in this note is to point out the possibility of using Birnbaum’s recent (1952) tabulation of 
the distribution of Kolmogoroff's D-statistic in connexion with some of the problems discussed by 
Maguire, Pearson & Wynn (1952) in their paper on industrial accidents. In Table 1 of the latter рер 
we are given the time intervals between accidents involving more than ten men killed for the perioc 
6 December 1875 to 29 May 1951. The question arises whether the data are consistent with the assump- 
tion that the accident rate over the period has been uniform. A 
Let us begin by labelling i = 1,2,...,n the n (= 109) accidents considered, in a random manner 
(i.e. not, for example, according to their order of occurrence). Since the accident labelled ? is oqually 
likely to be any one of the accidents, it follows that if the accident rate is uniform, this accident m 
equally likely to occur anywhere in the time interval considered. In othor words, if x; denotes the time: 
measured from the start of the interval, at which the accident 4 occurred, т; is an observation пою 
a rectangular distribution with range from 0 to Т, the total length of the interval. And if the accidents 
are independent of one another, the x; are n independent observations from this rectangular distribution 


If we form from these n observations the sample cumulative distribution F i manner indicate 
by Birnbaum (1952), we have DID ied 


пР (2) = no. of accidents occurring at or before timo 2, 
while for the theoretical cumulative we have 


nF(t) =n.t/T, 
and so if D, denotes Kolmogoroff's Statistic, 
nD, =n Max |Р.) 
P feed М2 0] 


oes: (no. of accidents up to time t) = nt/T |. 
То estimate nD, from Maguire et al. Т | 
ү - Table 1, we notice tl imo i i excl 
tions, below expectation until the 38th. accident, which бол ta ete oe a rae 
point we have 105.19 days from tho beginnt 
| Pioa(4191) — F(4191)| = 38. (109 x 4191/26263) 
= 20:6 approxi 
) , ximatel, 
н D, is г 20. й ; 
а eis, ms eA, дв our data. Now according to Birnbaum tl А ton 1:0276 у" 15 
so that the 1% Р ti уе 80. We find 1 и мл oxime Е. 
h о point is ехсее xs 4 109 = 17-0, appY*^. ste?’ 
ат i ® Significant departure from the hypothesis (оі 
п 13 about 95, fact. Either the сво ае мир" jndep x 


(0) being a function of t. We то о 4 
. 8 b 


by the condition du = ch ot i ria ipo 
Thus the FEE. я A(t) dt, it is evident that the acr oer If we introduce a new ‘time’ V^ уде 
given above applies, with the result | зета will again bo uniform referro 
we 


wien (no. of accidents up to time t) — : т | 
1f, for example, 3 м eT. E | i 

" А es 
erm within the modulus sign becom 
d e- 262634) 


109. (1 еу 


is well within the 9 
5 0, 
Sponds to a Тода ће 


and we find that nD, 


k = 1/20000. This corre 


ke 
limit (p; ‚єс we t 
din 8 by Birnbaum аз 1-3581 /7) # MTS 

nt of about 1:8% per annum in the ? 
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ede талы data are consistent with the possibility of an exponential decrease in the accident rate. 
hing = a to any given probability level there will evidently be a highest and a lowest value of k 
dace bs consistent with the data, at this probability level. These two values of Ё will give a con- 

e interval for k, assuming the decrease to be exponential in form. Similar arguments will apply 


t А * 
9 any other parametric family of decremental curves. 
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Further notes on the analysis of accident data 
| By B. A. MAGUIRE, E. 8. PEARSON AND А. Н. A. WYNN 


1. Wo are very glad that Prof. Barnard (1953) has drawn attention in the preceding paper to the 


күш: the Kolmogoroff test may be used very simply to establish departure from randomness іп & 
Шады events occurring in sequence, either in time or space*. The figure reproduced below illustrates 
ie ы the application of the test to the accident data from our earlier paper (Maguire, 
ime e & Wynn, 1952) used by Barnard. Ift; measured in days from 6 December 1875, represents the 
Sari x occurrence of the ith accident following that which oceurred at t = 0, then the 110 cumulative 
Аы 5 Points (t; i), i = 0,1,..., 109, have been plotted (section A-B of the chart). A central, con- 
Шеш line joins the first and last points (0,0) and бат), where n = 109,4 = 7 = 26,263; this is the 
dist etical eumulative line. Two parallel ‘control’ lines have been drawn on each side of this line at 
ances (measured parallel to the axis of ?) of + 1:6276,/n = + 17:0 forming а significance belt. 
tates that if the accidents have occurred at random with constant 
hen the probability is approximately only 0-01 that the track of the 
e this belt. The points cross the upper limit of the belt where 
ОЛЕН, hown that when n> 100 there is good agreement as far out in 
tail as tho 1 96 point between the true and the limiting distributions of the Kolmogoroff statistic. 
We may assume that the approximation is also adequate at more extreme limits, it is possible to make 
© Of Smirnoff’s (1939, 1948) table of the limiting probability integral. This shows that 
nD, = maximum | nem — i |. 
16 0-001 % lovel. It ів quito clear, therefore, ая Barnard 
ficant departure from randomness, the average interval 


2 T he Kolmogoroff theorem s 
wi ims during the period T, then | 
i nave points (t; i) will pass outsid 

› t = 3931. Birnbaum (1952) has 5 


ex 


ШЕТ 


Push is 24-6 at i = 53, is also significant at u 
1 avin ы that there is evidence of very signi 
„18 Increased during the period. 
| ү Before following up one or two points conce 
9, it is perhaps desirable to say a few words abou 


rning methods of analysis suggested by Barnard’s 
t tho particular accident data of this example. The 


“Buros aro taken from Tablo 1 of our 1952 paper and show the intervals between mining accidents in 
Тоз Britain due to explosions involving more than ten men killed, during the 75-year period 1875-1950. 


hey wero introduced to illustrate certain points in our discussion—for example, the use of the following 
ll distribution of 109 intervals was consistent with sampling 


d tests of the hypothesis that the overa 
Ша single exponential population: | 4 
(а) the M-test (heterogeneity of variance) applied to single intervals, 

(5) Fisher’s g-test, based on the ratio of the longest interval to the average interval. 


Neither test established a significant departure from the exponential law. 


4. Alth қ hould have proceeded further, we did notin fact attempt to analyse 
these duo ле 9 Ыр RAE ine for changes in accident expectation within the 
erin at à 


ber е. we did not о that the accident intervals wore sh 
под. Th, 1 off test, namely, that the acciden: intervals were shorter 
. The tout by the Kolmogor Д | VAGO EUN 
од the ieee а А the period, can also be established quite simply by using another test 
4 hich we nos к. E В: ыы Dd 1952, p. 172) but did not use on these data. If T, is the time covered by the 
TS guir ә р 
^g accidents and 7, the time © 
T, = fs lo. 


* Some remarks of Bartlett (1949, 


overed by the next 54, then 
= 8042, T= ов — ба = 16864. 


p. 216) are also of interest in this connexion, 
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If the accidents have occurred randomly with constant expectation, TT, should. ra pun 
as а variance ratio, F, with degrees of freedom v, = v, = 108. But F = 16864/8042 = 2-10, 
significant at the 0:1 % level. | | | мг. 
5. Tostudy the changes that may have occurred in the risk of explosions in mines would + о : be 
consideration of a very complex story, and it would be out of place to make the attempt - а 
following summary, however, gives a broad picture of what seems to have occurred. There ар wed 
have been a substantial improvement in mine safety from 1870-90, and the average gravity of p Il 
as judged from the number of casualties per explosion, was less in the period 1920-50 than pe imei 
1890-1920, but the frequency of explosions was not reduced; on the other hand, there may have : А й 
some increase іп the risk of explosion in recent years in relation to coal output, the number of а 
employment and the number of mines being worked, but this has not been fully investigated by 188, 
authors. There has been а considerable reduction in mining accidents in some other important cat egor 


160 


Theoretical cumulative line 
and 1% limits for пр, 


For actual period AB 


120 For period CD 20 accidents | __ __ __ __ 
later in cycle 


=> 


Accidents 


12 14 16 18 50 28 30 
Days (thousands) 4 24 76 


( ) 2 вей 
а ve funct: й mple, under certain conditions 108 
cycle and look for a test for detecting chan a ihn arrange the sequence of intervals iP 5 ій 
point in the cycle. nges in A(t) which is independent of any particula" 
7. In the diagram we have intr fo 
‘oduced this id. 162 
the 109th. Tfno 118 idea. by ада уй nt^ 
Еве shat 2 =  Койтовогойғ test to the Strela pu Ја 2nd, 3rd, ..., ete., m acid jg 
0:597 limits ara iar S accident of the origina] Бан instead of AB, i.e. to the ae ра! " 
own by broken lines. The track of seh ee s Санай Ше ш К 
ive sample points now 
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outside the 99 % belt and significance would not be established by the test. The maximum value of 
| nt,/T — i | is now 13-3, which (using the limiting distribution of Smirnoff) falls near the 8 % significance 


level. 

8. Although the caleulation is somewhat laborious with n so large, it is of interest to consider the 
application of a further test which is allied to that of Kolmogoroff, namely, the w?-test developed by 
Cramér (1928), von Mises (1931) and Smirnoff (1936). If F(x) is the continuous cumulative frequency 
function of a random variable x specified by the hypothesis tested, and if 21, 25, ....%, ATE N observed 
values of x arranged in ascending order of magnitude, then 


1 12 2—1]: 
2 ЕТ, . 
o = Тп? 15 (не) 2n | а) 


For the сазе where the distribution of v = ¢ is rectangular in the interval (0,7), F(x;) = F(t) = ыт 


and we may write 1 12 . 

по = —— += 2 {nt,/T — (i — 0:8). (2) 

12% m^i-i 
maximum value of » times the difference t,/T — i[n, the 
w? test, uses the sum of squares of the n differences d; — tT- (2i — 1)/(2т). It will be seen that if 
the pattern of the n events is reproduced on а line of unit length, then d; is the amount by which 
the ith point is displaced from the corresponding point in the regular series 
1/(2n), 3/(2n), ... (2#— 1)/(2n), -.. (2n — 1)/(2n). 

of rarer occurrence, w? may increase 


Thus, whilo tho Kolmogoroff test uses the 


Thus if denser concentrations of points alternate with stretches 


Significantly above expectation. 
have been known for some time, but we are indebted to Mr В. A. М. 
в about the mean, Thus we have for the first four 


Moments: i МИ | 
1= 5, Ш 7 eon? 
= = f | " 
32n?—GIN+30 = 496n?— 15827: + 1671: – 630 | 
бут ie > 7560008 


normal. It will be found that when n > co the limiting values of the 
of the cumulants of the limiting distribution of nw? given by Anderson 
s ments in deriving an approximation to the distribution of no? in 
but the following values of the standard 


s distribution is very far fror 
oments agree with the values о 


Darling (1952). The use of these mo 


relati eS fuller consideration, 
а и samples а ет jid and у= аи, which we owe to Thomas, show that when 
tion and the moment та 1 = А 


be approaching the limiting form: 


"= 109 the distribution of nw? must 
n S.D. Ay Bs 
Ба РЕ 5:53 11-23 
10 ie 6-03 12:24 
20 0.148 6-33 12-87 
~ 0-149 6-43 13-08 
100 0:149 6-52 13-26 
шп 0:149 6-53 13-29 
© 
We shall therefi below the percentage points of the limiting distribution tabled by Anderson & 
М herefore use below ^ 
arling (1959, p. 203), in particular : "T s 
4% 1% % 
5% 0-499 0-743 1-168 
-461 
° $ of the Examination for the B.Sc. Special Degree 


аг 
of | The results were included in ап езаву presented as р 
he University of London (1952). 
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10. We have now calculated nw? from equation (2)* for the n = 109 accident times 2; considered 
previously, (a) starting from the zero pointat 6 December 1875 and (b) from the date of the 20th accident 
in the artificial cycle deseribed in para. 7. We then find for (a) and (b): 

(а) тог = 1-541, which is a value far beyond the 0-1 % significance level given above; 

(b) по? = 0-474, a result significant at the 5% (but not at the 4%) level. 


Tt follows that for the present data, in its original form, both the Kolmogoroff and w?-test give Very 
clear evidence of departure from randomness. When, however, the intervals are arranged in cyclical 
form and the analysis starts at the 20th accident, the former test does not, and the latter does, establish 
significance at the 5 % level. 


11. It would not be legitimate to make any general comparison of the two tests on the basis of these 
special results. One broad conclusion, however, may, we think, be drawn. As one of us has emphasized 
before (Pearson, 1942), while it seems attractive to transform a statistical problem into one of testing 
whether a sample has been drawn from a rectangular population, it is still not possible to determine the 
most efficient test to use for the purpose unless we can formulate the type of departure from the rectangle 
that is likely to arise. It appears that the Kolmogoroff test will only be powerful in detecting certain 
kinds of variation in A(t) and a similar position will hold for the w*-test. 
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On a method of estimating biological populations in the field 


By С. C. CRAIG, University of Michigan 


If it be assumed that the probability that a member of a biologi ; i ass b? 
found in any subregion of a well-defined region is proportional chee ies н ae дын ош» 
way to estimate the population of the entire region is to count the numb oat han random! 
chosen subregion. If the total population of the region is M, if the vo f amr d wa subregio” 
chosen to that of the whole region is p, and if n is the ntimbe К 5 54 —— =. * hen th? 
likelihood of the sample result is т found in the subregion, #79 


M 
(mecs 4-1-р 


which is maximized for the integer (or integers) м satisfying the inequalit 
цай У 


mo 15 й 5 2 (1) 
p 
Tn large samples the variance of this result is that of n/p which i 
ach 18 
d мї, (2) 
p 
* 'The sampling distributi 

within a fixed interval (0, 7), io. they asan p fall 


ћи 

i at the date of istributions for fixed T and have set T = 18 
neither Barnard nor the of the last accident, + A "апа т. We hav pu 
error is unimportant, present authors have quite kept to d he Easington disaster of 29 May 1951. T, 


he rules in making their calculations: 
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However, one f i i 
тое of my biologist colleagues has proposed + imati 
tho жа distances between р апа стен = par ee о о 
stochastic ron area and let the co-ordinates (x, y) of the position of cm p T aie = ы ОЕ 
observe the dista es each obeying a rectangular distribution law. An ава m Ж. 
to the co-ordinat ae to his nearest neighbours in each of the four quadrants dst = een 
should the total je axes drawn through his position. From a sample distributio = -— а a 
Sites x DAE be тты and how good an estimate сап be oe ici ad 
point; а X umed to obey a constant distributi i У 
For wh lnk ES pe from whieh to observe distances 6 ‘tenuate кенің p oe e 
C(0, a), and = E will take the region to be a square with its vertices at 00, Ри ee 
This avoids t} will consider only distances to individuals lying in the first трага 4 T » er 2 
the four itm s complication that for a choice of Q the distances to the ра = bina “У criem 
Ааа и Q will Јо be independent. Further, I assume that eno ои T "i 
ring, or at any rate is such that each new choice of Q gives an inde үкі 
pendently 


observed value of 6. 
€ 
B 
(0, a) Ші (a, а) 


о A ж 
(а, 0) 


То 
E s the probability density function (p.d.f.) for $ there are four cases to be considered. Setti 
tins a сече a—y =z, the four cases are: (1) = the smaller of w and 2; (2) д> the smaller ‘of w ме. 
For с he larger; (3) д> the larger of w and z but S y(w*+2*); (4) b> Aw? +24). 
Populatia ( 1) suppose first that Q lies in OAB. Then for any given Q for which now ô <w, if the total 
ion in the whole region is Л, the p.d.f. for ô is P 


aM 16% M-1 
да dd (650). 
( 2.) (55%) (3) 


the p.d.f. for all Q’s is 
ол М-1 2 
тмд Жы: (.-2 dà (850). 
On . За? 4а? а (4) 
summi 
g this over all 6%, from 0 to 4» the total 
2г(М +1) 1 4 там 
–— LS) , ЕТ ‚„М+1) = 
1- Fars) пар Мура s ) 


Sin, 
бё ыы: 
© this is independent of w, 


frequency for case (1) is 


РМ), 


rson. 


E ena 
hich Т. (р, а) is the Incomplete Beta-funetion of К. PA 
А ases, namely, those in which for a chosen Q іп OAB, w<dSzand 


or 
two of the three remaining © 
ifficult to find the p.d.f. for 8 for a fixed Q, but I have been able to express 
atures. For the fourth case, in which no member 


< 
5 
the S Vlw? +28), it is not di 
Over all random positions of Q only as quadr j 
is found in the first quadrant with Q as origin, the probability of such 


n e ау м in the region 18 
18 1--1 
а га шг\М dwdz 1 из 1 
E = ў SSP UM: 
( ) gd  M+li=1* 9 " 
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Now a sample of 6’s obtained for N random choices of Q will contain k, 076 less than or equal + eo 
distance from Q to the nearer of the right and upper boundaries, i.e. belonging to case UR 2 па 
belonging to case (2), Ка 6’s belonging to case (3), and k, instances in which по д was те s on 
attempt to use all the information in the sample requires that a rather complicated likelihood une = 
be maximized. However, the short table of values of P,(M) given shows that, if M is more than 50, d 
use of only the portion of the sample in which the д° are not greater than the nearer of the right ш 
the upper boundaries utilizes on the average some three-quarters or more of the total informat! 
available. 


M РИМ) M РИМ) 
10 0-5058 50 0-7442 
20 0-6216 100 0-8134 
30 0-6804 200 0-8652 


The likelihood function based on the k, observations in which 6 is not greater than the smaller of 
w and z for each Q chosen is, from (4), 


іі пмб, > М-і ° 
= fh Ж, йүү, ар. 


dei 2а 4a? a 


Maximizing log P with respect to M, I get as the maximum -likelihood estimate of M, 
A key тө? (6) 
М =—k, log (1— — ]. 
à т 2 к ( 4а ) 


A 
The asymptotic variance of M is 2 М? 


A rapid method for estimating the correlation coefficient from the 
range of the deviations about the reduced major axis 


By С.Н. LEIGH-DUGMORE, Dunlop Research Centre, Birmingham 


In many studies of organie correlation (in palaeontology, for example) it is useful to fit the allomerr " 
growth line of Huxley (1932) by the reduced major axis method of Kermack & Haldane (1950). | sis 
method has three advantages: first, it is more approp: Шы 


because the biological variability of the material is usu: 
measurement and because the term 


When the distribution of the vari i 
» lates, ie: А Я і 
scedastie and nearly normal на eS, ог, in studies of allometry, of their logarithms, 18 ite 
simply be made from a consi Aeration mI case), an estimate of the correlation coefficient nt 
reduced major axis. 6 range of deviations among the scatter of points abo 


dg US 
ack & Haldane (1950) let x, y be 0 ted 


Ma panahe ie EE T re their logarithms, X, Y, to base h to be normally distribu, 
SM ecimal are more conveni en 2 P i 
the reduced major axis of the X, Y correlation боксе у и ionem XT ар, 2x 
aldane line 
Y-F s 0) 
We will rewrite this in the form AK 8x 
1=9-E F4 E y, P 
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in whi is А өс 5 * У 
m hich y is the estimate of Y from (1) for any given value of X. This may, seem artificial, since the 

ermack & Haldane line is not intended for estimation in the sense that the regression line is used; 
7] 15 introduced for convenience in the following argument. 


In (2) Y-XX-log b, (3) 
X 
К 
pe (4) 


where а, b are estimates of the parameters a, В in the allometrie growth equation 
у = Вл“. (5) 
The sum of squares of the deviations of the Y-values from the line (2) is 


S(Y—5* = 5(У—Ў—-а(Х— X)? 


220-758(Y-Yy, (6) 
where r is the correlation coefficient of the sample of X, Y. Hence 
S(Y –2)° 
fel-gp-r "m 


Provides an estimate of the correlation, given the sums of squares of the variate Y about the 


ermack & Haldane line and about the mean value, Y. 
In practice, S(Y — Y? will already have been caleulated and used to find a. It remains to estimate 
=~ 3. 
When the distribution is reasonably homoscedastic and normal, an estimate of this sum of squares, 
Sufficiently reliable for many purposes, сап be obtained from the range of the sample of deviations 
the mean range in different -sized samples such as Table ХХП 


m Tables for Statisticians and Biometricians, Part ТІ. ws 
d, usually on logarithmic graph paper, the extreme 
imate of the correlation coefficient is easily calculated as 


" he following example. When a desk B 
doubtedly more speedy than calculation of the product-moment correlation. For machine calcula- 


lon 16 is necessary to look up the logarithms of the data; for the range method the data have to be 
Plotted. If a plot is necessary for other purposes the range method has the advantage in speed; if 
® plot is otherwise unnecessary it may still have the advantage depending on the sample size and the 


heness of any grouping of the data. 
У groups to the logarithmic data of the Micraster cor-anguinum 


ап ex: le tl hod can be applied 
D ee ча и by Kermack & Haldane (1950). 


If the logarithms of these data are plotted (to base 10) and the line (2) fitted (caleulated from the 
garithmic data) the greatest positive deviation from the line is 0-081 and the greatest negative 
87 0:206 and, since the mean range of samples of 


eviation — 0:195. Hence the range of the deviations is 
38 from a ај population is ct 5-83 times the standard deviation, we have 0-0354 as an estimate 


of the standard deviation of the deviations about the line. Substituting this and Kermack & Haldane’s 
Value for 83, (= s% when corrected to logarithms to base 10) in (7) we have a value of 7 = 0-8839 to 
==, 


Compare with 0: + 0:0116 given by Kermack & Haldane. 
"he vw readers SR ne examples by Dr D. Parkinson and the author and has been 


compared with the product-moment correlation. The agreement has been found satisfactory and the 


ifferences have not affected the results of tests of significance. 


Population, used as an illustrative exa 


REFERENCES 
London: Methuen. 
aie correlation and allometry. Biometrika, 37, 30. 


and phylogeny of the Lower Carboniferous Brachiopod Schizophoria 


Soc. no. 1484, p. 52. 


foxy, J. 8. (1932). Problems of Relative Growth. 
Kumwicr K. A. & Награхв, 7. В. 8. (1950). Orgar 
^RKINSON, D. (1952). Ontogeny 


resupinata. Abstr. Proc. Geol. 


990 А Miscellanea 


The effect of overlapping in bacterial counts of incubated colonies 
By C. MACK, Shirley Institute, Manchester 


INTRODUCTION 


ка и zer 
1. To estimate the number of bacteria present in a given atmosphere a sample of the air is F nak vh 
a plate of suitable material in such a manner that the bacteria are deposited thereon. The p тете 2554 
incubated and the bacteria multiply and form colonies. However, colonies whieh are close toge КӨСЕ? 
coalesce or one type of bacteria may inhibit the growth of another type by antibiotic effect во ja Gore 
count of colonies is an underestimate of the original number of bacteria present, and this pape over? 
cerned with the correction to be applied in such circumstances. Armitage (1949), in a paper eae does 
lapping in dust particle counting, mentions the similarity of the bacterial colony overlap pu ieee 
not deal with it in detail. However, there are some differences in the two problems, e.g. the colon ‘ vieni 
cover a substantial fraction of the total area of the plate, the dust particles usually do not; a dens ation, 
the dust particles are of small but finite size often of different shapes, while bacteria aro, before incu. 
good approximations to mathematical points. d. Thus 
There is a further difference between this paper and Armitage’s in the approach adopte il d be 
Armitage estimates by the first few terms of an expansion the number of ‘clumps’ which di sion 
formed, on the average, by particles placed at random over the plate, the convergence of the меуге the 
depending on the ratio of dust particle area to the total area. Though I have given formulae um. 
occurrence of ‘aggregates’ formed by points placed at random (Mack, 1948, 1949) this direct app ight- 
was rejected in favour of treating the problem as one of inverse probability. By this means stra Е не 
forward exact formulae for the ‘most probable’ number of bacteria and the posterior probability о 


has 
‘probable’ number of bacteria are obtained appropriate to each particular count. Lidwell {= 
given a formula for the most probable number, but the analysis is based on admittedly оуег-вип} 
assumptions, and his result is too low, 


DERIVATION oF THE FORMULAE 
2. Let g be the number of observe а», ...„ад the area of each of these, у 
c ыа of the plate upon whi Ow suppose m bacteria fell origin® у one 
ог simplicity, assume the ; ве m bacteria one must have fallen in б + 
in ag, and so on up to ay, whi ini fallen somewhere in the areas dys &w 1 ^" 
А . ere in the areas d 
Assuming each particle of such an occurrence is given by 


A being 


Fim) = "(т Па, (m — 2), (m— + 1a, са +. E au moa d 
A A UO | == DU > ) : 
If we use Bayes's postulate tha 


1 
ERU acteria, originally deposited is the smallest positive value о “ 
F(m+1) – (m+l1)r 4 
Fm) = m—q4i~ 3) 
(i “а п = [41 —)), | i 
e square bracket denoting the ‘integral 
4 1 gral part of’. Dividi i es Of 
i E : ividing Е 3 16 ђ 
i т-4 to % 15 unity, we find that the posterior rob "i (m) by a suitable factor so t! zm per 
acteria is given by Probability f(m) of m being the original 1 
fim) = 1) (m= аа Тењи] — руна (p 
4! 
* The areas іп which bacteria could hi ai 
the smaller values should thy =o Ра be som | 
ор» us be taken for q 9 А 


^ may also be derived by usi 


in, 
Е the Principle of maximum likelihood. 


а 
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A similar argument to the above shows that if the prior probability were proportional to (a) 1/m, 
(0) (т + 1), then the most probable number would be, in case (a) [(9 —7)/(1—7)], and in (5) [(g - 7)/(1 —7)]. 
Since q is usually of the order of 100 the differences are negligible in practice, so that the reasonableness 
of the inverse probability treatment is justified. 


3. If there are several different types of bacteria, then each type may be treated separately. Thus, if 
there are qı colonies of type 1, formulae (3) and (4) will hold for the most probable number of type 1 
bactoria and its posterior probability with q replaced by q, and where 7 is the ratio of the type 1 areas 
plus any area in which their growth might be inhibited to А. 


вз may have to be modified if, as in some kinds of apparatus, the plate 


4. The assumption of randomne: 
t a distance R from the centre of rotation have a 


rotates under a rectangular slit and points on it а 
Probability of receiving bacteria which varies as 1/R. All that is required, however, is that each of the 


areas ал, ...,@, shall be suitably modified (mult iplying by 1/8, where Fi, is the distance of the centre of 
gravity of the area from the centre of rotation, will be sufficient unless it is a large area). 
from (4) for quite a number of values of m before 


distribution. However, it is possible to represent 
1 distribution with a maximum at n and а standard 


E 1f qr is large it is necessary to caleulate f(m) 
obtaining a clear idea of this posterior probability 
Хт) approximately, if gr> 10, by a continuous norma 
deviation 

(а) (1—7). (5) 


For, replacing log (m!) by log I'(1 +m) and this, in turn, by its asymptotic expansion 
(m+4)logm—m-+ flog (27), 


We find that log f(m) =m log (mr) — (m — q) log (т — 9) +1108 [m/(m — q)] + constant, (6) 
dlogf(m) . Ж x Tu 1 
р log (mr) —log (т 9)+5 |? (7) 
diogf(m) [1 1 1 [ 1 1 ] 
OBIM | = -—— |“ 8 
dm? “Lm m-q. 2|(m—-g? m? (8) 
(7) shows that Дт) has a maximum when 
Ог this value of m, (8) shows that 
d*logf(m) =- о). 10 
ат 0 qr gr? (10) 
We now compare f(m) with a distribution 
g(m)ec exp {— [m-q r) + МІР. (11) 
derivatives of log (т) and log (т), provided higher derivatives 


5); 
(5) is now proved by equating the second t 
davo only a small effect. To estimate the magnitu! k 
erivative in the Taylor series of log (т) about т = Mo. 


4(1+7) 1 
pom cen [о] 


h I am indebted to the referee, may be obtained by using the 


de of this effect consider the term involving the third 
For m = m, + 2(gr)*/(1—7) we then find that 


r whic 


An alternati ге, f 
native normal curve, 10 ү 1 С 
bus that f(m) is the general term of the negative binomial 
1 T je 
b Zr l- 


Whi riance #(q+1)/(1 —r)?. These may be taken as the mean and 

Vas h has a mean (4:-7)/1-?) peii к t correction for continuity’ used (Yates, 1934). The 
i pos ® ficant На = 0(100) which is usually the case. 

lly graded by the size of their apparent areas. 


i to 

or overlap larger particles, or may overlap each other 
Е ount of small particles is an underestimate. To calculate 
cular grade of size, we find the area in which such 


= In counting dust particles, the particles are usua 
W small ; be ођве 
К er particles may 

orm apparently larger particles: thus the e 


Я i "ti 
P Correction to the number of particles @, say, in a pam 
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particles could be covered by, lie on the top of, or by overlapping form part of, larger particles. se ds 
the ratio of this area to the total area in which particles could fall. Then, by the same EE jr 
Section 2, equation (3) gives the most probable number of particles of the given grade. It is true t = Me 
does not tell us how many of the larger particles are really composed of several smaller ones and t ла! ; em 
count of these larger particles should be reduced. Mostly such combinations will consist of one B а 
overlapping one large particle, and if the small particle is subtracted from the apparent area the р жез 
will usually remain in the same grade. The reduction due to this effect is thus, in general, unimpor ме 
In any case the theoretical results worked out in this paper should enable experimenters to decide У 

concentration of dust particles (or bacterial colonies) is permissible to achieve a desired reliability- 
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Non-normality in two-sample t-tests 


Вур. С. О. GRONOW, Institute of Aviation Medicine 


1. In a recent note (1951) by the present author the bias and power of two-sample t-tests Wee 
discussed. The notation in the present pay 


5 à с, ane 
à t L pres per 18 somewhat different to that employed before, 
accordingly the previous discussion is sum: 


к " les 
c iod marized here using the now notation. Given two вар. 
v(i = 1,2,...,n) and aj (j = 1,2, ..., n'), two criteria, 


и 2 
Lu mme HL ———— = 
(eis m= l)p па and v= 
nn’ n-4n'—2 ] ) 
where zal Еш и К 
ші а 
" i а l n 
8° = — - 2 a. J 
wae т), 8 a oer X (ај 3): 
were considered under the assumption that the i mally 
e 5 5 : Же populations generatin thes rere both norm 
distributed, but might have different variances. The method used to 5 T Pa que SRM | power was 
obtain approximations (in series form) to the moments of u апа F e Mtn ваши putio™ 
of these criteria, by frequency curves | OF SO, i 


| о 
rst four moments, and to find the propo! + ove? | 


vy" | 
ow a ава е te ine аташы) т in the calculation of the moments was 11 the 
assumption that the samples had come from ano ES M а <a ы. дене ; dil 
another method was used. n-normal population would be prohibitive. Ассо? 
2. David & Johnson (1951) have poj 
pointed out th: ~ 
variables is found usually in order to be able to iaka о кт ы a © any P 1 | 
atement of the for 
here L t iii, dur 
Where Lig 15 the value of the abscissa i À ” 
р of the dist ec 
$e in the tail. This Probability statement is the 5 =" the ordinate of which cuts off a proP? 
5 
E (2 — 1, 
: #)>0} = 4 
and, following Cramér, the i Е - d 
: » әлеу pointed out that istribus; С 
as that of the ratio, provided 9 is ney > 2” distribution teI des oo m" 
expressed in terms (E ее ег negative, It will be ја у could equally we poth 
atisties as functions such аз noted that the шынын жон 


МЕ, — ki) (e, ча), 
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ы Н b are different for u and for v, except where n = n’, when w=v. Thus we may make the 
P (| be — №) (а) | > Lya} = до, 

Ор P {6300 — ky)? ГА (ko + akg) > 0} = о. 

J = бб ki)? — Та (о +ak3) 

сап be written down under very general conditions. If б іѕ the difference between the population means, 


4 if the dashed cumulants relate to the second population, then the first two moments of this quantity 


The moments of 


6 KS Р 
E(J) = ve ©) — 18 (Ks - akt), 


, 
Ka K: 


бұл) = bit + 6 + =) І? (к + «кә | 
+ [> (3 = х) TR (е = =) | 
п" п °, n n 


K № к. К%\? 
жо 4. а += 
n? n^? n т 
3015, 5 etes rac (+) | 
г = n 


n 
2 кї 
магы. ТЕЕ) == 
Ы + 


The third and fourth moments follow similarly. 


З. Tho moments calculated in 52 are quite general, the only assumption that is made about each 
Population being that its cumulants exist up to any required order. It is possible, however, to simplify 
fin, expressions for these calculated moments somewhat if we make a further assumption regarding the 
тең сопа] form of the population sampled. In previous work it has been found that asymmetry in the 
B a ulation generating the samples has little effect on sampling distributions, but that kurtosis may 
ó *r nominal significance lovels appreciably. We shall assume, therefore, that the parent populations 

an investigation may be described by two symmetrical Gram-Charlier Type A series. It will follow that 


„~ == VAS E 
ку = Ky = Ke = Ко = 0 апа к; = — 35а 


*esulting in considerable simplification of the moments of J. We have that 


М) = 919+ ei 5) — Dis (ra + ак), 


” 
Ka Ка 
и) = wef +5) 


, ; 
Ky Ka ата (Ка | 954 а (Ка ака 
ңа) - n Lm Lt 


пЗ m^? 
Ky, кеја а ( 8 ак? 
ЕЛ БЕ) +214, ne e О 


“ith Similar, longer expressions for из and да. 
ments of J it is necessary to approximate to its distribution by assuming 
] отта, the parameters of which have been found from the known moments. 
$ the momental constants f, and fj, would indicate a Pearson 
e curves are, however, intractable to handle arithmetically 
v) were used instead. Using both tails of the ¢-distribution, 
Sonina lte a, ано due to kurtosis, in two specific cases, is shown in Tables 
se. and 1 (5). TH эль =: E + of non ity is seen to be noticeable for both the u and v criteria, but it 
Cems to Sikes ne өкшесі foni le sizes are unequal. This suggests that if the 
р "ulations may be ne а aS we qual variances, the advantage pointed out by Welch 

е non- 4 
WE using v rather than u when comparmg the mean 


Or iu aving obtained the mo 
eli 18 distribution afunctiona 
Typo У calculations showed tha 
ange LV авап approximate graduation. Тћез 


johnson (1949) ‘unbounded’ type curves (5 
-tail test, the bias 
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Table 1 (a). Probability, as a percentage, that u (= v) falls beyond the 5 % points of the 
t-distribution with 18 degrees of freedom. Case n = т = 10,6 = 0 


(Symmetrical Gram-Charlier populations, / = //;.) 


9 
К/К 1 2 3 
Ра 
2-5 5:33 5:58 5-92 
Normal 5:00 5:13 5:30 
40 5-51 5-74 6:13 
„50 5-94 6-29 7.31 


Table 1 (b). Probability, аз a percentage, that u and v fall beyond the 5 % points of the 
t-distribution with 18 degrees of freedom. Case n = 15, n’ = 5, д = 0 


(Symmetrical Gram-Charlier populations, 2, = //;). 


Criterion Kalka i H 1 2 3 
Bo 

“ 2-5 1-56 9-45 5.28 10:34 16-49 
Normal 1:40 2-95 5-00 9-71 13:20 

4-0 1-65 2.58 5-91 10-73 14:59 

50 1-83 2-81 6-55 11-14 14:93 

v 2-5 5-83 7:14 9-12 11:65 12-74 
Normal 5-47 5-85 6-83 7-99 8:67 

40 6-37 6-89 8-76 11-04 12-01 

5-0 7-50 9-11 11:37 13-32 15:23 


Table 2. Effect of non-normality in parent populati ДАЙ 


; ons on the power of the u and v tests to detect а diffe 
6 between population m i ОД и ; 5 ‚ Саво 
ien d um E ват (using the 5 % significance points for t with 18 degrees of freedom) 
Power of u (= v) test. 
wr 2 B 
, У(к + кд) 4 
кук» 0 0-5 1-0 15 20 25 3:0 
Ра = 
1 25 0:053 | o E | sai 
ee 0.050 | 0196 | 0590 | 0-893 | она | огоот | 1000 
40 0-055 0:562 | 0.892 | 0.987 | 0-998 | 1%00 
50 0.059 | 2205 | 0494 | 0-887 | 0.80 | 0-997 | 1000 
216 | 0587 | 0-883 | 0.978 | 0-997 he 
2 25 0-056 i 
р 0-197 i .000 
Normal 0-051 | 0.188 s 0-894 | 0-984 | 0-998 1000 
50 0057 | 0-207 | оез 0:892 | 0-988 | 0-999 1.000 
0 0-063 | 0.999 0-889 0-980 0-998 .000 
0-581 | 0.88 É 1 
9 | 0977 | 0-998 p- 
3 2-5 
0-059 0- 0 
IM 0-053 | 0250 0594 | 0889 | 0-986 | 0-998 то 
EA 0-061 | o212 | 0559 | 0893 | 0-987 | 0999 | 1:000 
| | 0:224 | 0.590 ES 0-980 | 0-998 | 1.000 
| 0-978 | 0-998 ы 
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Te varying the value of 8, the distance between the population means, we may study the effect of 
es a in f; as well as in the variance ratio к/к» on the power of the tests, that is to say on the prob- 
the У of establishing significance. For the case of equal sample sizes, with n ='n’ = 10. Table 2 shows 
power of the test for inereasing values of the ratio 
0 = 6/3 (ка кз), 


Umen не to be the appropriate non-central parameter touse. Remembering the differences between 
ы T ym significance levels, i.e. between the figures in the column headed @ = 0, it will be seen that 
Ені E of equal samples, the power function is remarkably little modified either by inequality in 
aaa v variance or departure from normality. This result suggests that provided the two samples 
door or nearly so (as they generally will be in any planned experiment), the standard normal theory 
ag age for the t-test. may be used, even when the variances are unequal and Ра differs from 3. 
in y OP example, in assessing roughly how large samples must be to establish a given difference 
Population means using the t-test, Pearson & Hartley's (1951, p. 115) chart for», = 1 could be used with 
ф = 18 јпјо, v= n+n'—2, 
where the value taken for g? would be a rough estimate of the average population variance, (к-к), 
р on past experience. 
ne the sample sizes aro so unequal th: 
lei value, it becomes difficult to int 
Wen em have been made for the case 7 
is ho d have been determined for the same values of f. : 
Conte to make use of these results in a further communica | 
than н levels given in Table 1 (b) above, the effect of non-equalit 
аб of kurtosis. 


1 that the effective significance levels differ substantially from 
erpret the resulting modifications in the power function. 
2 = 15, п’ = 5 and tables of the power function for both 
a as in Table 2 and for к{/к = $, $, 1, 2 and 3. It 
tion, but it may be remarked that, as for the 
y of variance is far more important 


My thanks are due to Dr F. N. David and Professor Е. 5. Pearson whose suggestions helped con- 


81 а 
iderably in the preparation of this paper. 
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Note on the Poisson Index of Dispersion 


By М. KATHIRG AMATAMBY, University College, London 
and the validity of assuming that it is dis- 
m equal to the num vations minus one, has been discussed 
Jusion that the assumption of the y? distribution 
servations provided the Poisson parameter was large, but his method 
in that it was ison of moment ratios, these moments 
An alternative ap der the conditional distribution 
у Т, of the observed vi 
ob determined and hence the exact 
tain, rdi the Poisson law. 

ed by allowi to vary according to tne ‹ | е 
од of te аи ~ urther C culations have recently Е ed = by Lancaster (1952). Neither 
: : i 5 more than four observations. We 5 jer selves hi 
with ur Wisi eec пу E em i.e. with the overall rather than the conditional distribution 
9f the Tad. ae whee T may a Э rh of the rewriting ofa probability statement to calculate exact 
Oments Uk Thi pube ніш о obtain probability integrals which may be compared with those 
he з, а зев by 522 <> ments in а general form the power of the Index е Dispersion test can 

? Obtained т, by writing © tive hypothesis is that the variance 18 greater than the mean. 

‚ when the alternati = 
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n Index of Dispersion, 
nber of obser 


as 
not altogether satisfactory 


th 

оре selves being infinite series. 

e е Index for a fixed total, ва 
Very possible partition can be 
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2. It is assumed that there аге N random independent variables £y, Tg, ...,%y which may take any 
zero or positive integral values and each of which has the same distribution. If this distribution 1 
Poisson then the Index of Dispersion is written as 


N 
д (а,- 2)? 
1- < 


= , 


2 
where 2 is the mean, and the probability statement connected with the test of significance takes the form 
PU> Хула) = = 

N 
or Р(( > аав) o] = а. 
i=1 
If k, and k denote the first two of Fisher’s k-statistics this expression becomes 


PUN — 1) - ky 334,2) > 0} = а. 
Accordingly we focus attention on S-(N-1)b - hx up 
mla 


the moments of which can be written down exactly. Assume, for the time being, that the cumulants E 
the parent population are ку, ку, Ks, .... We note that both k, and к, can never be negative. Easy algebr 


gives 
é(S) = (N- 1) ка — Ку Xi- 
ка, 9кі к, к, 
A) = Way | e | 20-1) ne + thn, 


5 N-1» Kg како 407 – 2) а 8x3 
#45) = ( 1) м N(N-1) N(N-1s ^ * те 


4 C ~ 
-3(N-1y ERES tO. ] +3071) Жаа и ~ 4 


NW-1 
Ki 24k. Kk 32(N —2 l a 
B ou TR ) | S(&ANI-9N 46 
КВ (= 1) Ne ММП Миа aye it NAN 1 Б 
144к,к2 BS m) 48kj 
NN- NN IP шыс] 
Ик, к, 2 
= 407—1) x. A g ышы i 12k,kg 8(N —2) 24ка 
Воља] уз MN- NQN 1 + N(N pest NON —1* 
+6(N – 1)2 ЧЁ È NUN _ 4% 
t-te № МТ) 
K 
до ИУ 
When the parent Population is Poisson 
and the moments of § reduce to Do на "m 
48) = AN-1—4 ај, 
HS) = [9s A XN — 1) 2 1 
N Жаа Fia | emer n, 
а Y=? (М1 
нів) = | © aav n 307-1) 
1 
№ № Ха- а + № Xu | 
aaf S nan -5 1200-1), 
N - 


A N EM ВАМ — 1) 
к(8) = — IN — 14 4 
Ns UN S Tja гы... 
к. жаа — Та ay адаа] 
+721807 — 1) rays 
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Жо ^ Having obtained the moments it is necessary to decide on a suitable functional form with which 
ae рош etothe distribution. Preliminary calculations showed that the р, and f, of the distribution 
lay in the Type IV area of the Pearson system. While it is natural to fit a Pearson curve, since so 
и occurred naturally in sampling theory, the Type ТУ is intractable to handle. Moreover recent 
m agations have demonstrated the closeness of the probability integrals of the Type IV and the 
И nson S, curves with the same first four moments, and the latter system is considerably easier to 
ur pum In the main therefore either the Johnson 5, or the log-normal curves were used to estimate 
i PRU integral from the moments. In two cases Pearson Type IV's were fitted as a check on 
eet ohnson system. and in nine cases Edgeworth's form of the Gram-Charlier series. The difference 
mole tho probability integrals estimated by all three curves is not large. The results are given in 
^ e 1. These estimated probabilities should be equal to a value of а = 0:05 if the ү? distribution is 
Satisfactory approximation to the true distribution of the Index of Dispersion. The interdependence 


Table 1. Estimated probability integrals corresponding to a nominal ү? level of 0-05 


X À 1 5 10 
N S, or S, Gr.-Ch. | Туре IV | S, or 5; | Gr.-Ch. S, or S; | Gr.-Ch. 
— % | 
201 0 051 0-052 0-052 0-050 0-050 0-050 0-050 
101 0-051 0-052 0-051 0-050 0 050 = s. 
51 0-051 — == 0-050 — 0-050 ==> 
п 0-052 = = 0-048 — 0-049 = 
6 0-055 0-050 == 0-048 0-053 0-048 0-052 


of N and A will be noticed from the table. For either N large or А large, or both large, as Hoel noticed, 
the ye distribution is a reasonably good approximation, and in fact, there appears to be little risk of error 


or N as small as 6 provided A is greater than or equal to unity. Generally the larger the sample size the 
Smaller tho acceptable value of A, but this rule must not be pushed to extremes. Г, in practice, is 
s 'Scontinuous and may only take а finite set of values for given A and №, a set which will decrease 
rapidly with decreasing À. In fact for À small discontinuities are more likely to be a source of error 


t А P i. gt 
han assuming the distribution of I is x". х - 
по further interesting point is suggested by these results. For any given sample size it appears that 
Or А small the first kind of error is greater than that allowed for by the x? distribution. This in judging 
Significance from уз tables we shall tend to overestimate significance for А small and a low nominal 
8 ; a 
'Énificanco level should therefore be chosen. 


Table 2. Power of test when the alternative hypothesis is а compound Poisson. 
0:05) 


(Nominal significance level = 


кку=% кујк =2 


Type of series 


N 


0-669 0-956 0-960 — 


"Thomas { = = = 
; 0-666 | 0-669 
Neyman 0665 | 0659 | 0666 | 0-898 | 0-949 | 0955 
Neg. binomial | 


0-888 0-890 
Thomas 0-865 0-887 0-890 0-999 0-999 — 
Neyman 50824 | 05880 | 0886 | 0-988 | 0997 | 0-999 
Neg. binomial 


H 
[7] 
à 
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vA i istribution in 
4. The Index of Dispersion is commonly used to detect departures from cie si ше T can Bd 
the shape of the variance being larger than the mean. There are many population e i “eis gan Be 
chosen to describe this state of affairs; we have ed ourselves to such m ay 10 ud ihren 
described by a two-parameter compound Poisson series. For the sake of example w € inte dE 
types of such a distribution, the Thomas, the Neyman, and the negative binomial. T ae ji difforent 
these series can be chosen so that they all have the same mean and variance, but they wil iv ri 
higher cumulants. Substitution of the eumulants of any one of these in the general nra 0) and 
moments of S enables us to obtain an approximate distribution from which we can find Р. {S> аппа 
hence the power of the index of dispersion test with regard to an alternative of the selected comp: 
Poisson type. The results of such calculations are given in Table 11. А Жет) 
The power of the test appears to depend but little on the value of A (or кү) but increases, as is ехре 


$ А ost with 
with increase in sample size and with the value of k,/«,. The differences in the power of the test У 
regard to the three alternative series are not large. 
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On an extension of Geary's theorem 
4 Ву R. С. ГАНА, Statistical Laboratory, Calcutta 


А ignce: 
R.C. Geary ( 1936) has shown that if the sample mean is distributed independently of the sample Мајо 
^ then the population Should be normal. He assumed that the moments of all order in the pop“ ation 
should exist. Later on, E. Lukacs (1942) proved the same theorem, assuming only that the popu 
should possess a finite variance. ;puted 
| Basu & Laha (1952) extended Geary’s theorem by proving that if the sample mean is distrib tion 
independently of any sample k-statistic (as defined by В. A. Fisher), say k, (rz 2), then the рор“ d 
is normal. 
Here another extension of Geary's theorem is given. ite 
5 . à 1 
THEOREM. Letz,2,...,2, be identically distributed, independent random variables with ? © 
variance о?, ivon 
If now the conditional expectation of any unbiased quadratic estimate of cg? (c+0) for #' 
у ++... тн does not involve the latter, then each 2 should be normally distributed. 
А n 
бу, Proof. Lt Q= У 


ИР 024%; be any unbiased quadratic estimate of cg? (c4 
'Then, 


E 0). 
: from the definition of unbiasedness, we have 
f£. OP dinh 
or 
Г в? д Ne DES = с0°, where m= E(x), 
ог S auc: " 
м} 2 974-0 and У ay=0. | 
= „j=1 
{ Быш from the condition of the theorem, it follows се that 
140 ейл} = E(Q) Е(ей2г) a 9 
= 2 n 
where ¢(t) = E(e't7) stands for thi Em 
1 (ей) s е characteristic ћ i 
ce 2 - c function of the di 
РШ уш the left. | Y of (1-2), it can ђе aeei о pm to the for? 3) 
= Y d 2 ү 
Let E (за |) à aab (4/9) am в 
95 us now write Y(t) = log diz 
т ete an g ka ), с Y(t) represents the cumulant-generating function dei 4) 
ау 2j; ^ 
di [$ ana РИ d пр | 
. а di? $— "di ф E 
t 


pum Áo 


4 
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Substituting (1-4) in (1:3) we have 
әу n ата + 
Е id > + ( У" У ад = – со?. (1-5) 
E аё үшү јел 
Now, from the relations in (1:1), (1:5) at once gives 
а 
Esc 16 
d (1:6) 


that is, Y(t) isa polynomial of degree 2, which shows evidently that x should be normally distributed. 
Tt is interesting to note here that we need not assumo the complete stochastic independence of Xx and ©. 


From this theorem and Craig's theorem (1943), it follows evidently that if the sum 24--2%--... HUn is 
" ad n В 5. B 
distributed independently of any quadratic form Х аӊаа, satisfying the relations 


" ї,ј=1 ги 
(i) Уацт0 and (ii) X а=0 
" = ij 
Өй ЕА = 0, 


where А із the matrix of the quadratic form, and e = (1, Тен)» 
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The Doolittle method and the fitting of polynomials to weighted data 


By P. G. GUEST, University of Sydney, Australia 


A Modification of the standard Doolittle scheme of sol \ 
of orthogonal polynomials and standard errors has recently | в ruest, 


9 correlate and extend their treatments. 
Y he least-squares curve и (а) of degree p whi 
m the iti 3 
condition that У wa) а) uale) 
i=1 
Should be a minimum. We consider the case in which 
Polynomials Тв) of degree j іп а, j taking values fro’ 


p 
ult) = A by fi); m y 


ich fits the n observations (ау), of weight w(x;), is obtained 


it is desired to express v, (v) as a linear function of 
m 0 to p. Then $ 


Where the b,; are given by the normal equations 


У (го) Ye) – > by fled} Seles) = 0 (k= 0to р). (2) 
If we Write ` i 
E w(t) (а) o) = M» (3) 
Sule) fio) fie) = фл E (4) 
then the normal equations become К iii 


р 
> bys Pie = М». 
j=0 


% қ 
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i zi i thod 
The quantities фу. form a symmetric matrix, and the standard Doolittle or abbreviated Doolittle me 
28 B 
lution (Dwyer, 1941) may be employed. " "m . 
E aa ма orthogonal polynomials 7',(x), of degree j in x, satisfying the equations 


6 
Хақ) ТХ) Тих) = 0 (Jk). w 


r ; tor is 
These equations determine the orthogonal polynomials except for an arbitrary factor. This facto 
conveniently chosen so that the leading coefficient of Т (2) and f;(z) are identical. Thus 


1-1 (7) 
Туа) = f(x) + ти 

here, using (6), 8) 

Бл Шаа ag = — Уша) fj) Та) [7 wv?) ТЩ). | 
i i 9) 
Writing Sy, = D w(x) f(x)T (ж), к 

® = — 8,18 
апа Sy. = Хи) fj.) Uic) HAr, r- T's) +... + Tof) 
T (11) 
= irt Ж бум. 
т=0 


The quantities S;,, 0, can thus be built u 
values Sims Arm (m <). 


Tf u,(«) is expanded in terms of the polynomials T(x), 


» ted 
p in succession from the given фу, and the previously calcula! 


р (12) 
u,(z) = AM а,Ту(ш), 
j=0 
then the least-squares condition leads to the equations 3) 
; a 
а = MjIS;;, 
where М; = ХаҚа))у(а)) Т) 
ісі (19 
=M;+ У ам. 
k=0 ға 
The author has pointed out that the quantities S;,, а; a 


Р wi 
ity М}, a; are obtained directly in the по er's 
he normal equations. For example, in ЮРУ 
i.12..4-1) Further, if coefficients Бар are introduced such 


section of the standard Doolittle method of solving tk 
notation, S, = Фила. ы-у апа М; = М 


i 15) 
T(x) = „2 ыма), "t 
d 6) 
then Bir Og E o is P 
m=k+1 


and the Й;, can be built u 


p in the Doolitt 
The coefficients 5 y. 


table along with the a,,. 
> Can be written as * "n 


к 17) 
была > Gaby | 
ізі ) 
or 5 ү 
by = + Ў Ва | 
| | ізі р 
Equation (17) is used in the backwards sectio 


sion in terms of the Вы has 
degree 7 (<p) аге obtained Ъ 
quantities / in the estimatio; 


an 
n of the standard Doolittle solution. However, the sl a 
b, in the least-squares polynom y tho 
xiz in (18) for which k>r. The use 


cribed in the earlier paper. pt? 


е) 
4 om! 
wer moments or reduced factorial ™ 
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When reduced f i = Е: 
actorial s ar (С ich i i 
rial moments are used, фр; = Xw(x) (5) (9 , which is not directly calculable by 


repeated summati rev 4 
mation. However, фу; сап be expanded as a linear function of the Xw(z) (5) . In fact 


E (Ку (j+k—m қ 
"ES " x 
fa (») ( k ) 296 [52 al | (19) 


Th the m=0 
method of i ri г Fis ion (5) i i 
equation (19). calculation deseribed by Fisher, equation (5) is used, with the фу; calculated from 
Лике Vis : 
аъ es howev er, uses for the functions f;(x) the unreduced factorials 20 = 2(2:— 1) ($—3 T1) 
y linear combination converts the equations (3) into the equivalent set | 
p 
à. bp; Dwl) (2, + Б)#+® = E w(x) y(t) (2: + 8). (20) 
The ауа = Е : 

и Ie and moments occurring in these equations can be obtained directly by repeated summation. 
бало matrix of the coefficients of „у is now non-symmetrie, and the Doolittle method of solution 
ed ie ap used. For this reason Fisher's method of obtaining the normal equations is to be preferred, 

ese equations should be solved by the Doolittle technique. 
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A simple method of deriving best critical regions similar to the sample space in 
composite hypotheses 


tests of an important class of c 


By K. 5. RAO,* Department of Statistics, University of Bombay 
developed by Neyman & Pearson in a series of 


hypotheses, 
n mathematical statistics at various 


са Introduction. The theory of testing of I | 
5559 ` commencing from 1928, is an integral part of the syllabus 1 ; 
oe In teaching the subject, the author of the present paper found that it was possible to 
эп the rather formal step which Neyman & Pearson (1933) took in their paper ‘On the problem of 
Most efficient tests of statistical hypotheses +, in finding the test statistic to be used for a uniformly 


о 3 . à : 
St powerful test of a composite hypothesis regarding а normal universe. By using a simple lemma 


Bivi дар 
en below, a great deal of formal algebra of the original paper can be saved. 
ient condition that the variates X and Y may be independently 


2. 
dr Je lemma. The necessary and зас: ! А \ 
Muted as y??.variates, with m and n as their respective degrees of freedom, is that the variates 
(1) 


and 
V defined by nX = +, 


ely, as ап F-variate with т and n degrees of freedom and a 
the sum of two independent y?c?-variates 


ar 
ther words, 


ei . 
Хг pendently distributed, respectivi 
is jp) Stiate with m+n degrees of freedom. In o 


9pendent of their ratio. 


10 Proof} is easy and therefore omi of this lemma is that, if Ху, ..., X, are in- 


tted. An extension 
then the 7 statistics 


Че 
Pendent X202-variates with Ту -Mr degrees of freedom, 
т-1 
E У т, X 
mX, таз” Жы. ы 8 аы BU eg (2) 
АЯ Ek. „ = 
т» Хз тз Х,+Ха = ж, 


5-1 
ates and а y2a?-variate and the converse. Multivariate 


а 
given by К. 8. Rao (1951). 


To, 
ext si Deotively, т-1 indepen 
ns of the above result an 
» Nuffield Foundation 
+ It will be found in а n 


dent F-vari 
d their applications are 
Fellow, 1952-3. 


Dominion т. 
books on statistics. 


umber of text- 
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“ 


і a is 

3. Illustration of the method. The extent of simplification that results by using the -— d 

таа by applying it to Example 10 of Neyman & Pearson (1933, рр. 328-32). The а кн Ul 

d applicable to similar situations. In what follows, the numbers given in bold type а = Тр 
ae of an equation refer to the number of the equation in Neyman & Pearson’s paper. 

iti i i implificati ide. 

dure, it is hoped, will bring out clearly the simplification ma Ў Р К К. 

Phe test is that for the significance of the difference between two variances. Neyman & Pe 
consider that two samples, 


(1) E,ofsizez, mean =, standard deviation = 8), 
(2) У, of size nz, mean = z, standard deviation = s;, 


Н 424 " robability 
have been drawn at random from some normal populations. If this is so, the most general probabi 
law for the observed event may be written 


ЖШ др: 5 (ж, —а;)* +81 % (29 — ау) 
Жы ыы ы = (йы түт, РЕ > 201 | 20% cond 
where n; + п, = N, and а, с; are the mean and standard deviation of the first, and dy, съ of the ве 
sampled population. | , aed 
The admissible simple hypotheses include pairs of sampled populations for which ај, аз, 0, 7 9 927 . 


т ө t qum Lim r signl- 
may have any values whatever. Hj is the composite hypothesis that с; = с,. This is a test fo 


be 
Я ч s "s тра; 
ficance of the difference between two variances in two independent samples. The parameters 1 y 
defined as follows: 


For a simple alternative H,: 


а) — Ий = Ql “o= (4,160) 
HP = а, а =а-—а =, aP = 0р, of = | = боја. 
For the hypothesis to be tested, Но: m 
00 =а, g9-—b, 09 = 0, a) = 1. (5, 


А „деј the 
After showing that the conditions required by their theory are satisfied, they proceed to constru 
best critical region similar to the sample space. 


Tn the process they obtain the surfaces 


(6, 171) 
9, = 5, = const., (1 172) 
go = 2, = const., d 
1 2 2 (8, p 
фа = jy (M181 Hnes?) = 8$ = const. 
The element ($4, фу, фа) is the part of W(¢,, фр, фз) within which 9 174) 
BUE E, ть, 84) ро, P 
the value of k being determined for each system of values of 2, 25, Sa SO that “ 175) 
10, 
Pow, 9» $з)) = eP(W(d,, $» ф)). | 
The best critical ге, 


gion similar to the sam: 
with respect to фу, Øy, ф, 


that, for 
(а) 0 = соја > 1: 


m 


tained by integrating the clement бе буо 
values. By using condition (174) they 


ple space is ob 
з Over their ranges of possible 


the ‘best critical region" (в.с.в.) will be defined by 1 178) 
на осі p 1 
"s 83> Кит, Ba, 52), | 


(b) 0 = соја, <1: the B.C.R. will be defined by 


79) 
2,1 
815 es (0,2, 55) б jion! 
After devoting the whole of M 5 Кш 
в da foll we: © of p. 331 and part of P. 332 to formal algebra, they obtain the bes 
4) 
(a) For alternatives 03» 0,: T€ __ _ á (1% 19 
1481 - n, ^ ч 5) 
(b) For alternatives 03< 0: аре f 4 (1% ш 
E UR E С 
148i та ~ 0 
However, the 1 асат * 
Dividing E Усы al the transition from (178) to (194) very simple 88 pr E 
2.02. will be defined by У ihe Positive quantity, (madi ла), = Nsi[n, in ease 9 a? 
78% 
и = 


а а 
781 + тай Essa, 62), 
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The left-hand side of the inequality is a function of the ratio of two independent y*c*-variates, 7, 51 
and 83 which are independent of 7, and Z,. From the lemma it follows that the left-hand side is statistic- 
ally independent of %,, % and 82. Hence any percentage point of ће statistic u on an intersection surface 
Біуеп by (171), (172) and (173), is the same for all such surfaces and hence the в.с.к. is given by (194). 
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REVIEWS 


D 5.15. 
Econometrics. By GERHARD TrwTNER. New York: John Wiley and Sons. 1952. $5.75 


Quantitative economies has three main branches. First there is the expression of the laws of wee 
behaviour in a form which permits of empirical application. Secondly, there is the problem, : Ke 
great practical difficulty, of extracting from the observed workings of the economic system t gie 
by which the magnitudes entering into these laws may be estimated. Finally, the estimated magni ‘ning 
must be applied to the theoretical laws in order to calculate parameters and test hypotheses сее 
them. Strictly speaking, all three branches have their place in econometrics, but the usage of t ле Снан 
has tended to become restricted to the last of the three; it is with this branch of the subject 

Dr Tintner’s book is wholly concerned. : gial 

Earlier books on econometrics have dealt mainly with the economic background or with p the 
parts of the field, and this is the first attempt that has been made at a systematic exposition is 
Statistical methodology of the subject. The book is a compendium of known procedures rather dhe 
а research monograph, and thus will be of great value to the practical worker although contar 
little that will be new to the mathematical statistician. 

Part I gives a non-technical introduction an 
numerical illustrations of the application of the 
part of the book, with its lavish supply of references, 
survey. One confusing feature is the treatment of li 
of the linear relation У = %+PX. Presumably on 
holds exactly. However, we have next ‘the empiri 
not hold exactly’. But we are not told 
cannot be the sample estimators, 
are assumed to be independent; о 
are random variables. It would surel 


of 


ће 


а ate difference method, does not seem ver сова 

dite es Ый ерту of the special problems that arise Owing to the fact that most eoon rion 
of time series, i 5 

SE frei boe be aie Particularly helpful is the stress laid on the analysis of reg" pE 


probability, frequency distributi : М the book 
sample and sequential ; 


ч : Қ ват су 
в variables, Р; t n defective, and Part III is on single it 
Statistical methods likel SOY describes the t eory of 5 i у ае? В. of 
b У of control charts, and Part 18 
variance, regression and » includin, 
algebra are contained in 
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of symbols and special technical terms. There are over 300 exercises for the reader, many of them 
Containing sets of data from the American literature on quality control. The book as a whole is copiously 
illustrated with worked examples, tables, diagrams and nomograms. 

Part II on acceptance sampling for proportion defective suffers from the failure to emphasize the 
fundamental distinction between rectifying and non-rectifying inspection and from not stressing the 
Tole of cost in determining the amount and type of sampling. The result is a clear account of the main 
types of scheme but, apart from one short section, no indication is given of which type should be used 
In given circumstances. A whole chapter in this Part is devoted to a detailed description of the sampling 
Schemes used by the U.S. Department of Defence. 

Part IV on control charts follows the usual American practice of setting control limits three standard 

$viations from the mean even when sampling binomial and Poisson distributions. An unusual feature 
05 the treatment is that the operating characteristics of the charts are always given. The account of 

Compressed-limit gauges’ (p. 296) is rather misleading, the important paper of W. L. Stevens (J. R. 

tatist. Soc, B, 10 (1948), 54) having been overlooked. The use of control charts to help to improve 
Processes во that they become ‘in control’ is clearly explained but the inexperienced reader might get 

© erroneous idea that the state of ‘control’ is easily attained. 

' Part V includes, besides the conventional statistical methods, some recent developments not normally 
"Ppearing in text-books. Examples are the use of short-cut methods of analysis of variance based on 
Tange, J. W. Tukey's gap and straggler tests for comparing individual means in analysis of variance, and 
some interesting work by F. E. Grubbs on comparing the precision of different observers or instruments 

Tom results arranged in a two-way table. 4 қ : А 
di Surprising omission is that there is no adequate account of how to obtain samples. There is a brief 
'Scussion of the general principles of sampling, but the example given is highly simplified. Some 
scription of special methods of sampling powders, liquids, fibres, etc., would have been very valuable. 
‘ccording to the author's preface, the book is intended both as а text-book for engineering and 

"Siness students and as a reference book for quality control engineers and industrial research workers. 
“ngineering О aunt of we. The охо short course on statistics, are 
ely to find this very detailed and lengthy account of use. The excellent collection of exercises will, 
LoWever, be very valuable to anyone planning a course of lectures on statistics for engineers, and the 


Sok can be strongly recommended оп this account. Because it is reasonably comprehensive, the book 


58 i i i tance sampling. 
п also be recommended to technologists using quality control and acceptan: pling a 
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CORRIGENDA 


(1) TABLES OF PERCENTAGE POINTS OF THE 'STUDENTIZED' RANGE 
(Biometrika (1952), 39, 192) 


E : ig "this 
I very much regret to have to report inaccuracies in the lines у = 5 and 7 of the revised tables of ipe 
function. These lines were obtained, at the last stage of the computational procedure, by interpola iat 
between neighbouring values of и. Through an oversight on my part the convergence of the іп! 


А ге 
polation formula was not checked, nor were the four percentage points of g for n = 2, = 5, 7 compat 
with the corresponding percentage points of ,/2/,% 


Table of the upper percentage points of the ‘Studentized’ range, 4 


E S u — — 
v 5% points 196 points v 596 points | 1% points 
| | | _ = ——— 
n в 5 7 5 7 |% 5 7 Bw 7 
| | | E | 8:55 
| | и | 1747 | 630 10-48 ~ 
2 | 364 3-34 570 | 495 12 7-32 ea | 1070 | ЁТ 
3 | 460 | 416 | 697 592 | 13 7-47 655 | 1089 | 8% | 
E p E2 | #68 | з | ве | x4 | лю | 666 | iros | 90 
5 5-67 5-06 8-42 7-01 15 | 7-72 6-76 11:24 on 
6 6:03 5:36 8-91 7.37 te | 7-83 6-85 11-40 9-24 
7 6-33 5-61 9-32 7-68 17 | 7-93 6-94 11:55 a 
8 6:58 5:82 9-67 7-94 18 8-03 7.02 11-68 ен 
2 5:80 6-00 9-97 8:17 19 8-19 7-09 11-81 HH | 
10 6-99 6-16 10-24 8:37 20 8-21 TOT 11-93 909 | 
1 Е о 


- gerpolatio® 
ecessary to develop a completely different interpol& Е 
4 18 almost proportional to МЕ (n.— 1, v), where P(n- for 
| ДВЕ 
with range w in q = w/s, and (ii) У uF in 


In faet, for an 


Е Y given n, the ratio P 
easily obtained puo drin 


d 
Є --1 21 5 . coul 
by interpolation for the interr Diz is almost constant for all values of у and 


Further, this property 
' tables. Only for the lines у = 5» og О! 

» and the corrected values for these line hen 
of other values will, however, be pe M 
coming Biometrika Tables for Statistic’ pee 

я. 0. HAR 
(2) 2. 


Draven, Biometry 
? *a (195 
Tn the last column of the table half- (1952), 39, 299. 


“ау down the page, for 2.753 read 2-750. 
* We are indebted to 


: Mr George w. og out 
attention to the di : Бе W. Thomson of wing 
iscrepancies between the values of и Corporation, Michigan, for dra 

< tor v=5 and 7. 


Votumn 40, Parts 3 AND 4 « DECEMBER 1953 
E 


THE POPULATION FREQUENCIES OF SPECIES AND THE 
ESTIMATION OF POPULATION PARAMETERS 


By I. J. GOOD 


A random sample is drawn from a population of animals of various species. (The theory may also be 
applied to studies of literary vocabulary, for example.) If a particular species is represented r times 
in the sample of size N, then r/N is not a good estimate of the population frequency, p, when r is small. 
Methods are given for estimating p, assuming virtually nothing about the underlying population. 
The estimates are expressed in terms of smoothed values of the numbers n, (r— 1, 2, 3, .:.), where n, 
is the number of distinct species that are each represented r times in the sample. (n, may be described 
as ‘the frequency of the frequency r’.) Turing is acknowledged for the most interesting formula in this 
Part of the work. An estimate of the proportion of the population represented by the species occurring 
її the sample is an immediate corollary. Estimates are made of measures of heterogeneity of the 


Population, including Yule’s ‘characteristic’ апа Shannon’s* entropy’. Methods are then discussed that 
do depend on assumptions about the underlying population. It is here that most work has been done 
othesis can give a good fit to the numbers n, but can give 


by other writers. It is pointed out that а hyp Sat don 
quite the wrong value for Yule’s characteristic. An example of this is Fisher’s fit to some data of 


Williams’s on Macrolepidoptera. 


arandom sample to be drawn from an infinite population of 


Ls ls ‚ А z 
Introduction. We imagine 
and let n, distinct species be each 


ani 2 
s ds of various species. Let the sample size be V 
Presented exactly r times in the sample, so that 


© 
dS т, = М. (1) 
r=1 
whe Sample tells us the values of 7, По; ..., but not of nọ. In fact it is not quite essential that 
о Should be finite though we shall find it convenient to suppose that it is. 


Ve shall suggest a method of estimating, among other things, 
% the population frequency of each species; — | 
(ti) the total population frequency of all species represented in the sample, or, as we may 


wi ‘the proportion of the population represented by (the species occurring in) the sample’; 
i t ameters measuring heterogeneity, including ‘entropy "n 


ameters defined without reference to any special form 
ation of parameters for hypotheses of special 


Ш) Various general population par 
Seneral’ parameters we mean par ; 
ye d othesis. In $7 we shall consider the estim 
s, | 
о 
» " results are applicable, for example, to 
neness and of chess openings, but for defini 


orr т ә H me, together with an intuitive d. nstration. by 
ef ti tł intuitive demonstration, 
Dr mula (2) was first suggested to me, tog 


у $ of the credit for the present 

: М. Turi ral rs ago. Hence a very large par Е i 
Pape per o him for allowing me to publish this 
Won. should be given to him, and I am most grateful to hi g p 


of 
fo 
to studies of literary vocabulary, of accident 
teness we formulate the theory in terms of 


in Reasonably precise conditions under which our general results are applicable will be given 
1 M, but we state at once that the larger is n, the more applicable the results. When n is 
NBS, i oles for the most part attempt to estimate it. There 
Wi о Will also be large, but we shall not for 
ill be а fleeti oe + the estima he end of $5 and a few more references 
7 37 апа 8 agri Je, equation populations of known finite size, the 
- (See, for example, equ? p 
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tion of n, at t 
(73).) For 
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problem has been considered by Goodman (1949). He proved that if the sample size de 
less than the maximum number of individuals in the population belonging to a i к. 
species, then there is only one unbiased estimate of ж and he found it. He also pointe к. 
that the unbiased estimate is liable to be unreasonable and suggested some alterna г + 
estimates that are always reasonable. There is practically no overlapping between 
present work and that of Goodman. а 
_ Jeffreys (1948, 53:23) has discussed what is superficially the same problem as (i) ab of 
under the heading ‘multiple sampling’. p refers to some earlier work of Johnson (1992): 


) ч imself 
_ The metho Johnson and Jeffreys depend on assumptions that, as Jeffreys himse 


points out, are not always acceptable. Moreover, P Pr are not intended to be 
applicable when n, is unknown. The matter is taken again in $2. | 
"Oder ne MAP А of species has been mainly concerned with the fitting d 
particular distributions to the data, with or without a theoretical explanation of why inne 
distributions might be expected to be suitable. See, for example, Anscombe (1950), Cham m 
& Y ule (194 rbet, Fisher & Williams (1943), Greenwood & Yule (1920), Newbold oe 
Presto Ae (1944) and Zipf (1932). The methods of the first six sections of th 
resent paper are largely independent of the distri 188 
We shall be largely concerned with 4» the population frequency of an arbitrary spec! 


that is represented r times in the sample. We shall use the notation 6 (q,) for the ехресіе 


5 LJ ж” 8 
value of g,, in a sense to be explained in $2. Our main result, expressed rather loosely, 1 
that the expected value of q, is r*/N, where | - 


4 - ж. р 
4, ужа (74 TY, р е 


(The symbol ‘=? is used thr ughout to mean ‘is approximately equal to’.) More precisely 
the n,'s shou d first be smoothed before applying formula (2). Smoothing is briefly и 
in §3 with examples in 58, If the smoothed values are denoted by 71, n2, nj, ..., then 


more accurate form of equation (2) е. 1 
ч B ue CULA ; ч ; 
й бы) 252 аға % e 


The reader will find it instru 
form se-"ar[r! Then 7% reduces 
The formula (2) 


butions of population frequencies. 


* n В 15800. 
n sider the special case when n/ is of the Pols 
stant, ' 


can be generalized to give higher mome Mh " In fact 
~ < р 
ват) C a ; 


% | (8) 
= ‚Зит = 0,1,2,..), 
where 699 = 16-1)... (E- m+ 1), We cai Зуіп the form % 
Ји А 4) 
| дек fin. 
Moreover, the variance of 9. is ' à | 
Ғараб 1) g К. rin 
А ај LA $) 
FE) (Edu) жал. C у 
An immediate deduction from (2) і 3 É 
that Sa 4 } j 
represented т times (т > 1) in the oos dati d Po ei al species Ж 
А Я 
6) 
ла. > + ' 
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Qum also the expected total chance of all species that are represented r times or more in 
€ sample is approximately 


N-U(r+1) па + (7+ 2) nz g +... (7) 
In particular, the expected total chance of all species represented at all in the sample is 
approximately $ 
N-(2n, + 3ng +...) = 1- /N. (8) 
p may say that the proportion of the population represented by the sample i proxi- 
ately 1—n,/N, and the chance that the next animal sampled will belong to am ecies is 
“pproximately ІМ 4 ) 
| т]. - " (9 
(Thus (6) is true even ifr = 0.) * ' “ 4% yn Ж. 
4 The results 6), (7), (8), and (9) are improved in accuracy by writing the active 
ormulae as + ғ $ т " 
#. е @ Шыл. ' k |. (8) 
+ »d Co 
, ye път, ү (n) nantes ү «Je га ¥ ү: 
É T Tri , 
* 5 е; ` 
NET ты... 7. Же) 
"ыла а 
+ . ~ 
and ' Жк: . 17% / (9^) 
.. AB + 


ғ ? 
wn most applications this last expression will be е emely чо л тА. з Ка in its turn 
dm Often be very close to ту У. It follows that (8 and (9) are practically t ne as ( 8) 
a (9). For the sake of mathematical consistency е smoothing should be such that (8°) 

nd (9') add up to 1: 6: $ n 1» 
An index of notations used in a fi d sense i: given in 
am grateful, and my reader: ill also be g 


larify some obscurities, especially in $$ 1 and 2 " 
2, be s, which we suppose is finite. 


Proofs, Let the number of species in the popu ion 
This is the same supposi i 4 no is finite. Our results as far as 56 would be practically 
“changed if s were enumerably infinite, b. proofs are more rigorous when it is finite. 


Le : ; 
the population frequencies of the species Ж in some order, р, Ps; «++» Ps» Where 


жұрт 
UE бр more explicitly М(р Pas N^ the statistical hypothesis asserting that 

рц. р, are th 3 lation frequencies. { П discuss the expectation of n,, given H. | 
eias SEN He a f simply the observed number n, whatever 


Tt 
may be obj cpectatio 
thei Septem hae be ogically correct. Strictly we should introduce 


Informati i jection wou 
ds fend um variable that is the frequency of the frequency r in 


59. № 4 
Prof. M. G. Kendall for forcing me 


tog 


++... = 8. 


Та notati random 
à random 5 ALD forthe e notation &(v,, у | H) for the expecta- 
Чоп ce 74 his expectation would remain unaffected if particular 


Values In order to avoid the extra notation у, у We shall 


te & р instead of &(v, | Н ). Confusion can be avoided by 
"аф К «(а Вог бт, =. were ‘ov of the frequency r when H is given 
СА |Н) as ‘the expectation of the frequency of the лед! y À ы 


"ж: 
И 
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and when the sample size is №’. Similarly, we write V(n,) = V(n, | Н) = Vy(n, | H) for the 

variance of v, y given H and & (n? | H), etc., for &(v, & | Н). . Т 
We recall the theorem that an expectation of a sum is the sum of the expectations. a 

follows that @ y(n, | H) is the sum over all s species of the probabilities that each will occu 


r times, given H. So Ey(n,| Н) = &(n,| Н) = &(n,) 


s (№ = (10) 
= 5 (а-қ б 
Lo 8 (11) 
In particular Ey(%|H) = У (1—p,). 
pen 


If 8 were infinite this series would diverge. The divergence would be appropriate since ty 
would also be infinite. 

Now suppose that in a sample of size Ма particular species occurs r times (r = 0, L, > zl И 
We shall consider the final (posterior) probability that this species is the jth one (of рор те 
tion frequency p,). For the sake of rigour it is necessary to define more precisely up 
species is selected for consideration. We shall suppose that it is sampled 'at random ; he 
rather equiprobably, from the s species, and that then its number of occurrences 11 3 
sample is counted. Thus the initial (prior) probability that the species is the jth one 18 1 А 
If the species is the jth one then the likelihood that the observed number of occurren® 
is ris N 

(za - 2. 


Wewriteq, for the (unknown) population frequency of an arbitrary species thatis represented 
r times in the sample. The final probability that the species is the иһ one can be written 
Р(а, = p, | H) provided that the Руиз are unequal, (If any of the p,sare equal they 027 
adjusted microscopically so as to be made unequal. These adjustments will have по practi? g 
effect.) We may at once deduce the final probability that the species is the pth one by ane 
Bayes’s theorem in the form that the final probabilities are proportional to the initial o” 


times the likelihoods, We find that 
4 "A 12) 
Р (q, = p,| H) = POT pat j 


У p,(1— p) 
It follows that for any positive integer 77, "A 


5 тт (1.9) YN—r Фа 
егін О РК e 


P ы” si 
РА 1 = Руј“ 
= (7 - mem би (па | Н) e 
in view of (10) (туя 2 (а, Н) ° 19 
in view of (10 and of (10) with 47 
inus ) N replaced by y +m. Immediate consequences of (14 


5) 
“(а | H) = 1+1 бул (па |н ü 
ч "1 Ap @ = 0,1;2,...) 


M 6) 
Bis Vig, | Ну = уи, у б 
les о) 
ШАТ 2 
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I (r 4-1)! A 
Where Крах = (МИ! By lst | H) (17) 
1 LET Зр A 
= Woy и ОРА" = i Out Н)( | Н) (18) 


by (10) and (14). Tt is clear from either form of (18) that the numbers др y (t = 0, 1,2, ...) 
form a sequence of moment constants and therefore satisfy Liapounoff’s inequality. (See, 
for example, Good (1950a), or Uspensky (1937).) This checks that the right side of (16) is 
Positive, as it should be being a variance. [It is obvious incidentally that (16) would be 
true with іу defined as 6 (qt | H) times any expression independent of 4.1 

We can now approximate the formulae (14) and (15) by replacing бу (Прат | Н) by the 
observed value, n in the sample of size V, or rather by the smoothed value ж. If 


9 rm? 
Ж i$ very small compared with N, if n, and 7,4. are not too small and if the sequence 


PT, Mg, ... ls smoothed in the neighbourhood of п, and n,,,,, then we may expect the 
*pproximations to be good. We thus obtain all the approximate results of §1. Note that 
When the approximation is made of replacing Ay asus | Н) by та we naturally also 
change the notation &(g? | H) to £l"). For the results become roughly independent of 
unless the n,’s are too small to smooth. Observe that &(g? | H) does not depend on the 
Sample, unless Н is itself determined by using the sample. On the other hand, &(q;") does 
` Pend on the sample. This may seem а little paradoxical and the following explanation 
М Perhaps worth giving. When we select a particular sequence of smoothed values 
%, na na, ... we are virtually accepting a particular hypothesis H, say HN; па, ћу, ns, ...), 
mith curly brackets. (I do not think that this hypothesis is usually a simple statistical 
YPothesis.) Then & (qi) can be regarded as a shorthand for &(q}" | ALN; Ny, 005, Та, e) (If 
‘J is not a simple statistical hypothesis this last expression could in theory be given 

à definite value by assuming a definite distribution of probabilities of the simple statistical 
"Уробһенев of which И is a disjunction.) When we regard the smoothing as reasonably 
reliable we are virtually taking H(N; Mis Та» ng, ...} for granted, as an approximation, so that 
5 can be omitted from the notation without serious risk of confusion. In order to remind 
Ourselves that there is a logical question that is obscured by the notation, we may describe 
(47) as say a ‘credential expectation’. ж . 
fa Specific H is accepted it is clearly not necessary to use the approximations since 
equation (13) can then be used directly. Similarly, if H is assumed to be selected from 
Süperpopulation, with an assigned probability density, then again it is theoretically 
зве to disport With the approximations. In fact if the ‘point ( Pry Pay =>» ps) is assumed 
2 be Selected from the ‘simplex’ p CES = ЈЕ with probability density proportional 
Tip, ... p,)'-1, where k is a constant, then it is possible to deduce Johnson's estimate 

1 = ("+ >) [CN + ks). Jeffreys’s estimate is the special case k = 1, when the probability 
ensity ig uniform, Jeffreys suggests conditions for the applicability of his estimate, but 
ese conditions in not valid for our problem in general. This is clear if only because we do 


assume j 
‚ Jeffreys E sabe иш that all ordered partitions of N into s non-negative parts are 
itia] о le Johnson assumes that the probability that the next 


ly equally probable, whi 
ti ual sampled will be ofa 
an Mes that that species is already repres 


information that can be obtained fro 


particular species depends only on № and on the number 
ented in the sample. Clearly both methods ignore 
m the entire set of frequencies of all species. 


lvi 
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% a i S 
The ignored information is considerable when it is reasonable to smooth the frequencie 
of the frequencies. 


З. Smoothing. The purpose of smoothing the sequence t, по, ng, ... and replacing it =. 
а new sequence 7, na, ng, ..., 18 to be able to make sensible use of the exact results (14) F 
(15). Ignoring the discrepancy between Су and бу» the best value of n; would be &y(%; | у 
where Н is true. One method of smoothing would be to assume that H = H(p,, D» s и 
belongs to some particular set of possible H’s, to determine one of these, say Hy, by шшщ 
likelihood. and then to calculate т; as Gy(n,|Hy). This method is closely related to tha’ 
Fisher in Corbet et al. (1943). Since one of our aims is to sug 


: рпа 
gest methods which are virtually 
distribution-free, it would be most satisfactory to с 


атту out the above method using i 
possible H’s as the set from which to determine Hy. Unfortunately, this theoretic? у 
satisfying method leads to а mathematical problem that I have not solved. «ties 
It is worth noticing that the sequence {éy(n, | H)} (r = 0,1,2,...) has some prope! E: 
invariant with respect to changes in H. Ideally the sequence (n;) should be forced to ЕГ у 
these invariant properties. In particular the sequence {им = 0,1, 2, ...), Черле 50 
(17), is a sequence of moment constants. But if ¢ = o(4 N), then N-(r +)! Шала Hr Ө, 5 
that if = о(/ М) we can assume that the sequence 7! ^; is а sequence of moment cone 
and satisfies Liapounoff's inequalities. But this simply implies that 0%,1% 2%, mue 
an increasing sequence (see equation (2^)), a result which is intuitively obvious even with ow 
the restriction ¢ = olJ N). (Indeed, the argument could be reversed in order to obtain a ?' 


proof of Liapounoff's inequality.) We also intuitively require that OF T& 2%... should its? 
be a ‘smooth’ sequence. 

Since the sequence (шағам (t = 0, 12. 
ability distribution it follow: 
increasing’ 


„ођ“ 

+.) is а sequence of moment constants abe Maly 

$ from Hardy (1949, $11-8) that the sequence is ‘tO tely 
› ie. that all its finite differences are non-negative. This result is unfortun? per 
too weak to be useful for our purposes, but it may be possible to make use of some И 


е ; the 
theorems concerning moment constants, This line of research will поб be pursued м 
present paper. 


A natural principle to adopt when smoothing is that 


p= у (tim)? a? 


Should not be significant with ға 
formula for V(n,| Н) уне 


| ‚ applicable when 72 = 
эщ when 72 = o(N). [See formulae (22 


56 
jm 

edom. In §5 we shall obtain an appin be 

o(N). The chi-squared test will awn (80) 


) (25), (26) and, for particular Н'8, (65 


ore 
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y 
а о О. 
For some applications vary little em ing Vee hod, wii и er pics 
E e у ; shir g la e required, while for others it may be 
4 se quite elaborate methods. For example, we could 

E the n,’s for the range of values of r that interests us, holding in mind the 
theres oo test and the rule concerning vr. The smoothing techniques may include 
he н o reehand curves. Rather than working directly with 24, na, ng, ... it may be found 
E to work with the cumulative sums 724,74 + По Ny + по + a, ... ог with the 
рз ка he sums of the rn, or with the logarithms log 74, log na, log 73, .... There is much to 
de А working with the numbers т, у па йа... For if we assume that V(n,| H) is 
m E equal to т, (and in view of (26) and (27) of §3 this approximation is not on 
of 4 and ~ too bad), then it would follow that the standard deviation of Jn, is of the order 
a E t herefore largely independent of r. Hence graphical and other smoothing methods 
t voa out without having constantly to hold in mind that | 2; — n, | can reasonably 
Dn. oe у жағ ы ыы is ы hs ian im жет те transforma- 
саш s son variable, 2, was sugges ес by б р ) Ч order to facilitate the 
ыы var {апсе. He showed also that the trans ormaion \/(® + 5) leads to an even more 
E variance, Anscombe (1948) proved that (7+) has the most nearly constant 
ance of any variable of the form . Ax 4-6). namely, 2, when the mean of v is large. He 


attri і 
а butes this result to A. H. L. Johnson.] 
(ii) Calculate (r+ 1) пдајт. 


(1) Smooth these values getting, say; 7" 
* to improve the smoothing of the ж?з. If this makes 


т) in Possibly use the values of 7% 
rious difference it will be necessary to check again that the chi-squared test and the Jr 


ule he 
€ have not been violated. 


е 2% Light can be shed on 
Та. thed two or three times, possibly by 
^ short, the estimation of the q,'s shou 


the reliability of the estimates of the 4,8, etc., if the data are 


different people. 
Cm ld be done in such a way as to be consistent with 
Ко үа of probability and also with any intuitive judgements that the users of the 
od are not prepared to abandon or to modify. (This recommendation applies to much 
Ore general theoretical scientific work, though there are rare occasions when it may be 
e.) 


Pre Е 
ferred to abandon the axioms of a scienc А 
T n objection could be raised to the methods of smoothing suggested in the present 
jum. It could be argued that all smoothing methods indirectly assume something about 
( Stribution p,, and that one might just as well apply the method of Greenwood & Yule 
) and its по вв Ноћ by Corbet её al. (1948) of assuming а distribution of Pearson’s 
b ES ІП, Ap* e-/», or of some other form. Our reply would be that smoothing can be done 
fines only local assumptions, for example, that the square root of (л, | Н), as a 
. tion of r, is approximately ‘parabolic’ for any nine consecutive values of r. Moreover, 
io iy Often be more convenient to apply the general methods of the present section than 
at : 
tempt to find an adequate hypothesis, H: 
ty of the results of 551 and 2. The condition for the applic- 
the methods should be satisfied with his 
to the values of 7 and m used in the 


quation (14) is exact. In particular, if 


E ji 72 
abil; Conditions for the applicabili 
арр ~ ч the results of §$1 and 2 is that the user of 

©ximations to бу (Вент | H) corresponding 


ap li 
Plication, This condition is clearly correct, since е 
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т is large enough the user would be quite happy to deduce (9) from (15) with r= 0. w 
he will be satisfied with the estimates of say 41,4; and 4; provided he is satisfied with 
smoothed values (71, Na, n5, n4) of Ny, ћу, Ng and па. 


В " CET. d 

5. The variance of n,. For the application of the chi-squared test described in $3 we n^ 

to know more about V (n,). We begin by obtaining an exact formula for V (n,| Н) = Wu ier 
and we then make approximations that justify the omission of the symbol H from t 


: s М Š ы ; ined as 
notation. It is convenient to introduce the random variable Tur = Ly that is define 


1 if the ‘wth species’ (of population frequency Ру) occurs precisely r times in a sample ври 
N — 
N (H being given), otherwise x, = 0. Clearly P (x, = 1| Н) = ) pi —p,)**. Now 
6(п |Н) = «(Жау 
[Л 
= У 4(а,а,) 
ЊУ 
и%у 
= 7 б(2,)+ У E(x,2,) 
n pv 
N! wy ae қ (20) 
АТО 95) > 2Р1 — py — p) N. 
This is exact. We now make some а; 


A у 15801. 
is ox pproximations of the sort used in deriving the Poise 
distribution from the binomial, We get, assuming 7?/N, rp, and rp, to be small, 


у e-N. 
(ДА 


CH ар Bay, 
а N! 
BS rir (N= эр) PAPA. — p,— р,) 2 аа, 


= 6(n,| Н) + 


Moreover, it 18 intuitively clear that term: 


0 
А кө п 
is int s for which p, or p, is far from r[N can mak 
serious contribution to the summation in 


(20). Hence, if 72 = o(N), 


“Ер 
Ф| H)&(n, | H) +S ауа, 
ГАД 


=6(п, | H)+{E(n, | H)\2— Mai. 
Therefore the variance of n, for samples of size 


N is 
Уа, | H)=8&y(n, | H) =F (Np,)" е-2Ури (21) 
2 (r!)? 
32) 
=й. "m 1 /2r е 
vl [н)- (77) биом |н). 
Formulae (21) and (22) are ele od 
t i : 
before they can Бе used for gant but need further transformation, when H is unk? oto? 


š calculation. Noti 
population frequencies are E n 


that | e ехре 
Ч, (и = 0,1,2,...) there аге n, species whos N) 


` “ence we have for r = 0, 1,2,...; 7? 7 
Vn, | Hy (n, | H) — У n. | Ма, е- Мала 
u=0 и ЕП | 


ч=о "| у! 


740|H)- Xa (Y "n 


T со 

E(n,|H)=— X тите“. (24 

N r! u=0 ) 
ow for any positive x, 27675 <7" 677, во 


мет 
"| 


Eln, | E)E Vin, | H)S Eln, | Н) ( е ) (25) 
Using Stirling’s formula for 7— 1 we have 

&(n, Н) V(n, |Н) &(n, | H) | ве) (r = 2,8, ...), (26) 

While Em |Н) Уба |Н) > 6( | Н) 1 —e7) (27) 

ње formula (65) in §7). Now the most desirable value for nj would be «а, | H) where 

rue, so if our smoothing of the n,'s is to be satisfactory for any particular values of 


r j i 
Small compared with yN we may write 


fi 


i 
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Similarly and rather more simply, when 7? = o(N), 


со ате 
fn 5 — 
те У Т р (28) 
u=0 r: 


ed as a test of consistency for the values of 


a Р Е 
nd these approximate equations may be us 
међу to solve equations (28) combined with 


A and и“, Indeed, it may be possible iterati 
~. and thus very systematically to obtain estimates of л; and 7% for values of r small 
"pared with yN. This iterative process may possibly lead to estimates of nj and 0*, but 
have not yet tried out the process. For most applications the less systematic methods 
Previously described will probably prove to be adequate, and any smoothing obtained by 
“Алдың methods can be partially tested by means of x? in the form (19), together with the 
equalities (26) and (27). (See also the remarks following equations (65) and (87).) 


8. Estimation of some population parameters, including entropy. Let us consider the 
Population parameters 
| 
Cnn = X Pi C- logo" (m,n = 0,1, 2, ...); in 
= 
terogeneity of the population. The sequence 


whi 
‘uch can be regarded as measures of he i 
e ‘moment constants’ of the population, while 


с 
QUT bes = 8, C2 o бә. шау be called th 
1118 called the ‘entropy’ in the modern 6 


o 5 
ze generally, c, „ is the moment about zer 


heory of communication (see Shannon, 1948). 
8 Е : 
election of an animal (or word), where ‘amount of infor 


о of the amount of information from each 
mation’ is here used in the sense of 
ability. (The last sentence of p. 75 


294 (19500 i i ithm of a prob 
‚р. 75), i.e. as minus the logarithm р 
BR ne ) We find it no more 


im. M. S. Bartlett has pointed out. 
‚ at any rate when n = 0 ог 1. 
an unbiased estimate of Cm, о is 


his reference is incorrect, as Prof. 
than of сул 


i 

Pes to give estimates of c,» 
is an immediate consequence of (10) that 

4 — ym, 

faa = ую 2” т). (30) 

4. | 

5,0 18 in effe 44) to measure the heterogeneity of samples of vocabulary, 

ct ule (1944) to m! ; 

4 seal pe istic’ of the material. The sequence of all 


_1/№) the ‘character! 
ж For example, as 


nd 
P. he Calls 10,0006, o 4 
i Е population parameters Cj. 


рої Ping moments of 6,0 involves all the 
nted out by Simpson (1949), for large Ñ, 
4 
Vlé o) N (со 68,0). (30 A) 
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> can take 
Unbiased statistics are rather unfashionable nowadays, partly because they — 
impossible values. For example, é,, 9 could vanish, although it is easy to see айс o d на] 
(Compare Good (19505, p. 103), where estimates of c,, о are implicit for general multinomia 


: аР ‚апі 
distributions, по attempt being made to smooth the лз.) We shall find estimates ofc, ,an 
also estimates of c,, о that are at least sometimes better than ĉn, o- 
We have 
1 31) 
Сто Е NM x педа, | Н), | 
r 


$ ; : я A % à ы the 
since this is in effect what is meant by saying that ¢,, o is an unbiased estimate of сл, v If € 
statistician is satisfied with his smoothing, i.e. if he assumes that n; (n, | H), and if he 
forced N' — N, then he can estimate [PEE 


49 
~ = —— У gms (82) 
Ст, о = Noo за 
" 
Р Ме" F : 4 Моге 
and he will be prepared to assume that this is a more efficient estimate than 44,0: 


Я of 
generally if the smoothing is satisfactory for r = 1,2, ...,t but not for all larger values 
т, then a good estimate of c,, о will be En,o(t), where 


TENTE ин 4 
‚0 ш--- ren, МУ s 
ш Nw т=т р т={+1 1 


We shall next consider estimates c,, ү of Ста We shall begin by proving that (exactly) 


E и ET 


Ei Tei 
mi = о) У" Enma : 


4 | (34) 
-21944(т,| Н)— &(log (1—q,) | H]j- 
The differential coefficient in this ex 


suitable 
definition of &(n, | H) for non-integra| 


pression is made meaningful by means of a jon 
пао 


l values of r. This definition is obtained from ed 
(10) by writing T(N + ЈГ (у ++ 1) PQY — r1) instead of je à 

In order t : i : i 
Ek | prove (34) we shall need the following generalization of (13), valid for 


E(n,| H) у) |н] = ч 


(35) 
s A» —2,)" f(p,). 


1 . 4 e ег’ 
Wealso require the following property of the gamma function. Ifbisa non-negative inte 


Ы 141. а 9% 
M+) = 1+5+...+5-7, 


where у = 0.577915... 


| & 
is the Euler- : rey? 
Jeffreys (1946, $15-04).) Te inier с constant, (See, for example, Jeff 


10) and (36) that 


&(n,| Н) = ( А 
dr = LP, (1—p ~“ - " E 
үз 1t + +log 75, 
= іп, н 1 1 
i ізет а (өрі |), 
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by (35). Therefore 


: 1 1 1 
&(n, | на + x — 5, 6(n, | H)—E(n,| Н)ФПов(1-<4) |5] 


N 
= Eln, | Н) &(—log q, | Н) = (9 Ур, (1-р) (—log p,); 
д 
by (35) again. Multiplying by 7”/N and summing with respect to т, we find that the 
right-hand side of (34) equals 


n М-т = лы 
> C —m ) / ý a =)“ ia три log p,) = Cm 
as asserted. 


Қ d? 
Су, can be evaluated in a similar manner by first writing down (9 Eln, | Н), but the 


Tesult is complicated and will be omitted. 
As in the estimation of c,, о, if the statistician is satis 
Write : 2 


T 1 1 1 
m1 7 от > нен өт vag *ЁР Wor dr 


fied with his smoothing, then he can 


с log nf — llog (1.0) | Н]. 


If N is large the approximation can be written 


1 1 1 Ж ПЕТА 
қаның ( 424 ue +17 тшшн. = = |Н\ |. 
N 


Now itis; r у r*—r ёъ о!" “ИЖҚ кей 
v it is intuitively clear that e(n- , which equals ў , must be Оту = o(N), an 


therefore Е і 4 

1 Ў i M lese , 

12у) > ита ов = ( + 3 Tec m у) a log ns} 
L пина, + лови (37) 

= &,,olog N — xao 2! "74974; 57, | > 
1 
Wh aper oy 38 
ere ее; Y (38) 
Hu қ " 
п particular, the entropy @ 12,1, where А 
1 d , 

я ж. А = У 39 
&,-lgN-xyX mi 21% ni) (39) 


The differentiation can be performed graphically for allr or by numerical differentiation for 
Табан (For numerical differentiation see, for example, Jeffreys & Jeffreys (1946, 


9:07) ) Another estimate of the entropy is бі,» where 


^ 1 d oen! 
ба = tog N-A 0m (а G8) , (40) 


e first occurrence of n, in (39). This estimate, 


stimate of the entropy. It can hardly be аз 


an unbiased е 3 
Perhaps the best method of using the present 


t) defined in the obvious way by 


Е | 
А Which the “ prime’ has been omitted from th 
6 + 
NS has leanings towards being | 
9d аз (39) when the smoothing is reliable. aps! 
богу for estimating бр, 18 00 180 the compromise Čm,1( 
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d , ? r to 
analogy with (33). For large values of r, the factor g, Я log n; may be replaced by log 


5 eee , of 
a good approximation. Terms of @,, ,(é) for which this approximation is made, i.e. terms 
the form rn,logr may be regarded as crude and unadjusted. 


7. Special hypotheses, Н. In this section we shall consider some special classes of ш 
theses, Н, which determine the distribution Pw So far we have taken this distribution a | 
discrete for the sake of logical simplicity. In the present section we shall find it convenien 
to assume that there is a density function, (p), where (р) dp is the number ot species whose 


ized 
population frequencies Не between p and p + dp. (The formulae may of course be generalize 
to arbitrary distributions by using the Stieltjes integral.) Clearly 


[sora ==, ui 
ja nf(p)dp = 1. = 
The expected value of p for an animal at random from the population is 
(|н) = Уй = eng e 
The appropriate modifications of the previous formulae are obvious. For example, instead 
of (10) and (20) we have 


N\ isi 
ÈL) = S| (7 P Porras = 0-0) f(g) араа 


N\ [1 45) 
(8) [рана овуда. i 


0 
(45) that ól | 
44) leads to the less preci 


y(n, | H) = 2 Е +0 (у) 


Notice the elegant checks of (44) and 


Н) =. m Н) = 
Й( | Н) = 0. Formula ( "o| Н) = s, Ва | Ну = 1, noD 


: „mu 
se but often more convenient form 


ЈГ (PNY ean F(p) dp 


si uos и 

SA Jo PAVE рд (а oN), 

while a similar treatment of formula 
We shall now list a number of ді 


(45) leads back mer 
them. The normalizing Constants а; 


fferent t Чу to formula (22): 


ypes of possible hypotheses and then 
uced from (42), 


dise 
re all ded: 
H, (Pearson's Type 1) 


Ир) = 7 


( 47) 


21-р) 
н, (Pearson's Туре Шш) “+ 1)! 81 PY (ax –1,6> –1). 
ас 49) 
(2) СЕЗ е-ёр ( ( 


a 
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Н, (same as H, but with а = — 1): 


Кр) = бре» (820) (49) 
Ay: Др) = sAp*exp(—fp—ep3) (0>0,е>0). (50) 
—le-AP (у> à 
Н; (truncated form of Н): Кр) -|P м E 2r (51) 
H, (truncated form of another special case of А): 
[2777 >р), (52) 
fip) = | (PoP) 
| 0 (p <р) 


Where E(w) = —Ki(—w) = "ai c" du. Ei (w) is known as the ‘exponential integral’ and 
) (=w) ( gr 
w 


has been tabulated several times. (For a list of these tables see Fletcher, Miller & Rosenhead 


(1946, §§13-2 ава 13-21).) 
We list also a few less completely formulated hypotheses, Н,, Н, and H, for which the 
Population is not explicitly specified, but only the values of 6м(п, | Н). Hence for these 


Ypotheses the parameters may depend on N. 


Th (Zipf laws): бит, |Н.) ост (r2 1,620), (53) 
Where ¢ is often taken as 2 by Zipf. (See also (94) below.) 
Н, (Н, with a convergence factor): 
yT 
E(n |n (ғ>1,6>0,0<т<1). (54) 
r = 
Н, (a modification of a special case of Hg): " 
Ла" 
Eln, | Ho) = зт) (rz 1). (55) 


We now discuss the nine hypotheses. 
Qui Н, has the advantage that the ex 
"I5. We can see from (41) and (43) that 


act formula (44) can be evaluated in elementary 


а=, (56) 
a42 _ +++) за 
&(p | Fh) = 24848 («+1)(®+8+3)8 


o be small when p is not small and (р | Н) to be large 
hesis of the form H, is to be appropriate at all, we shall 
d æ to be close to — 1, by (57). 


І 
к Most applications we want f (p) t 
по Pared with 1/5. Hence ifa hypot 
wally want В to be large, by (47), an 
У (44 
(44) we see that " "ERECTAE " 
&y(n, | 8) (9 (CESVVIICES ES ES 


Өлсе, by (2^), if the smoothed values nj and та were equal to their expectations, given 
P We would have " (w+r+1)(N=7) (59) 
jE = Там" 
(8) ң, ient approximation to H, if £20. Strictly, the 
с ў а аѕ а convent Me e 
Ypothesis 77 hacer since it allows values of p greater than 1, but it gives all such 
2 
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values of p combined a very small probability provided that £ is large. H, was used by Green 
wood & Yule (1920) and by Fisher (see Corbet et al. 1943). We have 


В 


= Е _ +2 04-421 (60) 
=n? é(p| Hp) = 


B atls 
so that æ must be close to — 1. Hence, if r? = o(N), 


АРАН Ват)! ( ғ) Оде (61) 


(c1) 7! VN + N, 
which is of the negative binomial form. 


Gii) Of all hypotheses of the form Н,, Fisher (Corbet et al. 1943) was mainly concerned 
with Ну, the case о = —1. (See example (i) in $8 below.) Then 


вас, =, (6%) 
бут, | Н, af (ra) = d (r? = o(N)), (63) 


say. For large samples, æ (which, unlike 2, depends on №) is close to 1 and the factor 27 maY 


be regarded as a convergence factor which prevents 5 бу(т, | Hy) from becoming infinite. 
т=1 


The convergence factor also increases the likelihood of being able to find a satisfactory 5 


to given frequencies, %,, merely because it involves a пе 
à new parameter. 
We see from (22) that = 


Vin, | Ha) (n, | Hy) | КЕ 56 (”) (Ny (64) 
ES Я 22r+1\ y +2) |` 
ТЕ fr = o(N) it follows that 
СА | Њу Zen 1 /2r (65) 
Eee. 


Thus in these circumstances Vy(n, | H) lies between the bou 
for each r about twice as close to the smaller bound tha; 
the chi-squared test, where x? is defined by equation ( 
assuming (65) to be applicable whatever the distributio: 
course, we may often be able to i 


Am mprove on (65 
tribution of p. For convenience in applying bs whe 


ing 
nds given by (26) and (27) Не. 
n to the larger one. When арр 

19), we can hardly go far ek of 
n determined by H may be- и. 
п Н is specified in terms 0 
we give a short table of values 9 


вы КЕ gu ет (66) 


Ей 
i»? 


T k, 
5 А A : н 
т 
1 1:33 
2 1:23 б 1-13 
ТЕАТ te | а | ш 
4 1:16 9 111 5 1 == 
У 1м 10 k 14 i 
TR 15 1:08 
а tes ofr, the approximation 1 + 
( 


2V(ar))} is correct to two places of de? 
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Suppose we are given a sample of size № and we wish to estimate £ and т. The method 
used by Fisher was to equate the observed values of Ern, = N and En, = 8 to their expected 
values, (Note that S is the observed number of species and should not be confused with 8.) 


This led him to the equations 
flog,(1—2) = 8, N = fx/(1—z), (67) 


N x 
S —(1—2) log, (1—2)’ (68) 


which he solved by using a table of z/(1—2) in terms of logy) (W/S). 

A theoretically more satisfactory method of estimating 2 and = would be by minimizing 
№, defined by (19), with r = оо. This method leads to equations which would be most 
laborious to solve by hand but which will be given here since large-scale computers now 
exist. To prevent misunderstanding we mention at once that Fisher obtained a perfectly 
800d fit by the simpler method, in his example, i.e. example (i) of §8 below, though, as 
Pointed out in $8, Н, must not be too literally regarded as true. 


By (65) we may wri Р 
ау write ыы в pu 
, w= М зала). (69) 
The equations giving 7 and = will then be 
Руби |r = Уйла”, (70) 
ВХЛ, а" = Erk n,e”, (71) 


and these ; ved iteratively. 
equations could be solved itera У T . 
When 2 and а are specified the cumulative sums of &y(n, | Нз) can be found by making use 


© approximation А ar 1 
eap rloga) +g (1 + Нот gy А (72) 
"1 


1 
g $105 2 and © are negligible. 


Which will be a very good approximation if the terms involvin 
This approximation can be obtained by means of the Euler-Maclaurin summation formula. 
xima 8 


35, 57:21). 
8, for example, Whittaker & Watson (1935, $7-21).) 
= — lin H, weobtains =% and of course Фу (ту | Н) -оо. 


iii) and (iv) of $8 that we may wish to take 
example, На = — 2 we would obtain, 


(iv) We havejustseenthat when Е 

Ste are strong indications in examples (ii). ( E 
- 1, and then even worse divergencies occur. 

(61), the intolerable result 

«(т | HJéx (ts | Hp) = со. 
encies we could in theory use hypothesis H4, with a small value 
nfortunately, this hypothesis seems to be a. свиње “~ it is mentioned 
for its САП as intermediate between Pearson's Types ІП an pw 
vergencies is to use truncated distributions. These 
tically pleasing but at least some of them can be 
rm of H,. We may describe p, as the smallest 
In most applications it would be difficult to 
with any accuracy. In fact if the estimate of 


In 
of E to avoid these diverg 
Partly 


у) Another method of avoiding di 


rated distributions are not theore 
Pog 2 analytically. H; is 8 truncated fo 

ш Population frequency of any spectes. 
i Sample large enough to determine Po 
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: ll 
ро were to be reliable the sample would need to be so large that n, would vanish for all E». 
values of т. In the examples of $8, 7, is always larger than any other value of 7,, a 
samples would need to be increased greatly before one could expect even т; to vanish. 
i 73 
We obtain from (41) s = BE(pf). (13) 


ww (74) 

Now пак а ra rr en 
' ET 
an equation which is undoubtedly well known. It can be proved, for example, by 18118 


2). 
Dirichlet’s formula for у. (See, for example, Whittaker & Watson (1935, § 12-3, example 2) ) 
In particular, if w is very small, 


E(w) ~ — log, (y'u), » @ 


6 
where y' = e = 1-781072, ш? 


(Cf. Jahnke & Emde ( 


be 
1933, р. 79), where our y’ is denoted by у.) Since Ро is assumed to 
small, we have 9 


Е Жа” 77) 
s= —fllog(poy'f), posft-te-r-st, ( 
On applying equation (46) we see that 


” 
ИТЕК, " 
бул, I)e (ғ) (r2 1,72 = o(N)), 
79) 
(по | Hs) = PEL ром + Де = Blog [bay (N + f)]. | 
The check may be noticed that equations (77), (78) and (79) are consistent with 
* 
s = @(в| Н) = X ёб, | Н,). 
= 
. H І 
Formula (77) is of some interest, but in most applications both Po and s will be əй : 
metaphysical, i.e. observable only within very wide proportional limits r 
" DX ae of determining Ро Would not apply to the same extent Ра = — 2, ue p Р 
ypothesis Не. (This hypothesis is fa; +" и m 
Pu 6 Ypothesis is fairly appropriate for example (iv) of $8.) W° % 
бу(т | Hy) = АЕ [78 + М)ј~ — Хр log Poy'N 
БО 
: 1) 
Ex(n, | Hy) = Аа в 
х(п, | Н.) A=) ("> 2,72 = o(N)), 
where and A, unlik 
unlike f and Ро, depend on N and are given by 
МЕГА " 
N+ B 
and де ЖЫЙ „_ N+, e 
(РВ) ^ аур) бр=ехр [y-a 


If À and z can be estimate y 
and s can then be determ 


d from a « 
sample, then Po can be determined by (82) an 


ined from (41) 
8+8 = 2200] 


Band 
; Which gives | 
(в 7 
РЁ (ро) Зе. palog, (p,'8)]. 
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In order to estimate A and x from a sample, one could minimize у“, more or less as described 
above for Ну. For this purpose and for others it may be noted that, by (22), 


r—1 1 (27 
повео, Hoe sa (у) >з), (85) 
À 
Рут, | Ној бу(т | Нур (86) 
өзі 
Е. а к 
= if »=1. (87) 


By comparing (85) with (65) we can get an idea of the smallness of the error arising when 


calculating y? if (65) is used for hypotheses other than H3. 
Another method of estimating A and v, rather less efficient, but easier, is the one analogous 
to that used by Fisher for Аз, namely, we may assume that the expected values of № – n, 


and of S — n, are equal to their observed values, i.e. 


m jj = Ae a caos, (122)] = A(1—e7Y Уер), (88) 


|| 
© 
ч 


У S = Азор, (1—2) = AY(1—e-*) = АУ, (89) 


(S—n,)/(N — т) = Y3-(eF =), (90) 


= lim Y,, where Y, = 0 and, 


Where x = ре, We may solve (90) iteratively, for Уе: 2 m 


fora — 
кы; ы = 5 (ora 1)4. (91) 


When А and ж are specified, the cumulative sums of Ey(n, | Н) can be found by making 


Use of the approximation 
ior „i * y z 
Y gegel- lo - FC loget) + gri nl 


1 
14 Dogs. (92) 


: 1 Р 
Which will be a very good approximation if the terms involving $ log апа =. are negligible 
(of, equation (72)). If (1—2)7 is small while r is large, then we can prove the following 


PProximation: 
tər " ІНЕ —ajrl—y-1o * 1-2)-log,r]. (93) 
> вт | ВА, (1 x) [1 — y —log.( 
I 
Па is small but (1—2)7 is large, then 
= (93А) 


{>т ; 
>; Ex(my | Но = (1-2) 


93) апа (93 А) it is best to use (92), the calculation of 


When ; 
n of 
i peni that the error integrals may be needed to several 


ich į қ sis 

я is, however, ill-conditioned, 50 
Ima] places, Е 
Biometrika 46 
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is di b 
(vii) We now come to the ‘less completely formulated’ hypotheses. H, is discussed by 
Тірі, especially with 6 = 2 and also in the slightly modified form 
> 4 
£(n, |Н) ос). 


: > toup, 
(See Zipf (1949, pp. 546-7), where there are further references, including onesto J. B. ye e 
M. Joos, G. Dewey and E. V. Condon.) Yule (1944, р. 55) refers to Zipf (1932) and ob] 


Е its аге uD- 
to Zipf’s word distributions on two grounds. First Yule asserts that the fits аг 
satisfactory, and secondly he points out that (in our notation) 


N=6y(2rn,|H,)=0 if 1< £3; 


1 ; (Er(r — 1) т, 

А == 2 = = T 
while cso |7 p*/(p)dp = ép | H) “(тте 
(viii) Yule’s second objection to H, can be 

factor’, 27, giving H,. If H, is any good at all fo 

fairly close to 1. It would be of interest to spe 
by solving the simultaneous integral equations 


" ence 
overcome by introducing a ipt. ђе 
т any particular application then di fl) 
cify H, in terms of a density function, 


Ax 


т 1 Тә қ (95) 
тај, Gore foy (ia, 8) 
If £ — 1, then H, reduces of course to А, 


: ; iy of $8 
У because it Works so well in examples (ii) and (61) 05% 
Besides its formal similarity to Н, 


with ¢ = 2, gr 
(81). А disadvantage of not specifying f(p) is that Vy(n, | Н, 
out from (22), though it can а] 
Moreover, a correct specification 


values of the n,'s and is more likely to lead to a bette 
population. 


uatio 


се 
е wor! 
) cannot be conveniently 


о 
т understanding of the structure 


discussed for Н, and Ну, е 
best to guess a formula for Vy(n, | В, 


Пр 
after ex i s а е sha.: d 
discuss this method further in this secti es inenting with formula (29). Ы; i s 
« from the equations 


© 96) 
МЕ А ( 
Жент = +18, (1 — у], 


А 97) 
Желе А ( 
rrr) = zle+(1 ~#)log, (1—zx)], 
у = Etsy (у. (98) 
Nee 
© log, (1 = 2) f 
t can be determined either by tabul 26 
5 t; i ^ ал200. 
апа determining У from the шн. ing the right-hand side of (98) or by writing 2 = 
99) 
Tx = (% 
а" +8/N)~1 
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У сап be founditeratively by writing У = lim У,, where У, = 1+ N/S,and, forn = 1,2,3,..., 
Yoga = (1—е“ Ка) (1SN). (100) 
„АТ 
= (101) 


Then, by (96), we can find A from A= you 


Having determined А and x we may wish to test how well Нұ agrees with the sample. For 
this purpose we need to calculate cumulative sums of the expectations of the n,'s. This can 
be done by means of the approximation 
t>r JL 
X 
tO 1) 
deducible from (92). If (1—2)7 is small while 7 is large, then we have the following 


Approximation: 


1 ar 1 
= 0(— 108,2) - El- ("+ 1) 106,2] ++) т) ( + Hog, 5) , (102) 


i> 

У | Hya- (1—x)[l1-y-—log, (1—2) —log.r)}, (108) 
t 

deducible from and of precisely the same form as (93). An idea of the closeness of this 

APproximation can be obtained from example (ii) below. If 1—2 is small but (1—2)7 is 


arge, then 5 
de (103A) 


t>r 
У (| Н)= — RR 
у (| Ho (1—2) 


When in doubt about the accuracy of (103) and (103A) it is best to use equation (102). (See 


16 remarks following equation (93 А).) 
8. Examples. In each of the four examples given below we use at least two different 
Methods of smoothing the data. One of these methods is, in each example, the graphical 
thing t 


“moothing of Jn, for the smaller values оЁ and another method is the fitting of one or other 
of the nine айыда! hypotheses of $7. The discussion of these examples is by no means 
i 

ntended to be complete. 


Example (i). Captures of Macrolepidoptera in а light-trap at Rothamsted. (Summarized 


fro ed m = 240 
™ Williams’s data (Corbet et al. 1943).) N = 15,609, 8 
| 
" n. nY r Ny „ tis (summed)t 
LÀ T 

7 3-5 21-30 18 15:5 

1 35 40 2 2 2.9 31-50 16 18-0 

2 11 20-0 s 5 3-0 51-70 17 11-4 

2 15 13-2 T 2 2-8 71-100 8 11-2 

= 14 9-9 4 2-6 101-150 9 11:8 

В 10 7-9 15 3 2.4 151-200 7 7-4 

11 6-6 16 2 2.3 201-500 12 16-1 

и 5 56 H 3 2-1 501-1000 6 4-6 

5 6 48 І 3 3 20 1001-оо 1 0-9 

10 1 a5 20 4 1:9 2349 1 = 

| 
f Ing tables this wo? d ‘summed’ will be taken for granted and omitted. 
n future tal » 
We now pr t the results of the calculations, followed by comments. (The columns 
esen ne 


Jained in these comments.) 
| 17-2 


ће 
“ded niv in the table above are ех 
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а т” nt п” п" ni* Жж жж peek | ужи 
1 35 35 35 35 40 ri 1-4 1-8 1 

2 11 19-4 24-0 22-5 20-0 2-1 23 2-2 2 

3 15 13-7 18:1 16-3 13-3 3-0 2-9 3:0 3 

4 14 10-2 18-1 12.3 10-0 3:8 38 3-9 4 

5 10 7:8 10-2 9-7 T9 48 48 48 5 

6 11 6:3 8-1 7-7 6-6 59 5:9 5.5 6 

7 5 5:3 6:8 6-0 5:6 | 


The function n; was obtained by plotting Jn, against r for 1 <r < 20 and smoothing for 
I <r<7 by eye, holding in mind the method of least squares. (See note (i) of $3.) n, Жа 
obtained in the same way, but an attempt was made to keep away from the graph of n; 
(except atr = 1) in order to find out how different a smoothing was reasonable. Next пу WaS 


T . 
obtained by smoothing the cumulative sums У т. Finally, ту is the function obtained 
t=1 


by Fisher, i.e. using our hypothesis M, (equation (63)) with f = 40-2 and = 0-9974. A more 


i iy de ӛтеп © iv were 
complete tabulation of 27 is given in the first table. The ‘summed’ values of пр W° 


caleulated by means of equation (72). No statistical test is necessary to see that the fit of 


ту is very good. The values of ;* corresponding to the four smoothings of the data are 
denoted by 7“, 799, үзе and pitik respectively. (Logically this gives r* two different 
meanings.) (r**** = 0-9974r, by (2') and (63).) In accordance with § 3 we could force the 
тн, ete., to be smooth. This has not been tried here. What is clear is that if M, is not accepte 
then most of the values of r*, etc., are unreliable to within about 0-2 ог 0:3 The approximate 
values of y? given by (19) with r = 7 and assuming (65) are 10-9 es 9-4 and 17 
respectively. The number of degrees of freedom is He acia аьан 6 а т. 16 seems 
safe to take it as 5 for nl”, 6 for m - None of the values of e is particularl 


ap „ала эу and 7 for niv 
significant, though all are a bit large. The data can be blamed for the largeness of the value? 
R a Б | 28 

anit ought to be. Of the four smoothings Fishe? | 


of x2, since n, is obviously much smaller th 
; . ане 
best approximations to the ‘true expectation 


| 4 
se оп the evidence of the sample, but Fisher’s зоо? 
imple. 


;ng 
am hide’ 
ample doesnot depend much on thesmoot је 


5? п not represented by the species in the — 
say from 5 to 8 e iria (65). Perhaps this standard error 819 

arent + ‚ to alow x = 
Е ; if it is applicable ( F d о given to n". 
written —logiopo = 1:18 + 0- orm, H, 


Formula (77) 


to sample sizes large en, 
be of the order of 10/p,. 
H, and Н, (with the ass 
т, on other independent 
of smoothing the data. 
the following considerations, ким statement 


tribution defined by H, would stan” (о 
sand ру. У would P® pat 
ага; nade probable by the actual sample га 8 of 

meters) would give good fits to the wale od? 
запа И, provide good me 
can be more fully justifie 
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ле. were reliable then it ghoid be possible to use it to estimate the simpler measures of 
Pom. үе such ав с. ọ Now we can see by (30) that é = 0-03935 and 6, „==0-0085. 
Lr 1e caleulations, the complete data given by Williams must be used.) Hence, by (30A), 

is reasonable to write cs ọ = 0-03935 + 0-0007. Let us then see what value for c, 15 implied 


by H;. We have 
Er(r— 1) ту = fX(r—1)a* = fa? /(1—2x)? = 0-0243. 


[As а check, L pf)dp = В | peP dp = "IN qe-tdq- f = 0-025.] 
Po Pop 


Cle. ; 
red then Æ, cannot be used to estimate c; о. It would be true if misleading to say that 
518 decisively disproved by the data. Similar remarks would apply in the examples below. 


пр те (ii). Eldridge's statistics for fully inflected words in American newspaper English. 
"dridge's statistics (1911) are summarized by Zipf (1949, pp. 64and 25). We giveasummary 
ii) below; more fully in the second table. № = 43,989, 5 = 6,001. 
In this example the values of n, for 7 < 10 are much larger than in example (i), so we have 


far s : 
more confidence in the smoothing that is independent of particular hypotheses. We shall 
ations in columns and then make comments on each 


(7), (8) and (9), that the proportion 


of Zj ie ч 
ipf Ssummary in column ( 


T some of the numerical calcul 1 
of th їп. We may assert at once, however, by equations ] p 
я len population represented by the sample is close to 1—n,/N = 14/15. Ifa foreigner were 
айы ~ n all 6001 words which occurred in the sample he would afterwards meet a new word 
about 6-7 9/ of words read. If he learnt only S— = 3025 words he would meet a new 
-roots rather than for 


Wor i: 
fi About 11-6 % of the time. The corresponding results for word. 
Y inflected words would be of more interest to a linguist. 


| —F 
M (ii) (iii) (iv) (v) (vi) (vii) (viii) (ix) (x) (xi) 
ү f Vn, b, say -А rb, rb; say 6; [4 т r* 
prd 
| 
1 sor | gas | ses |218 | 565 | 566 | 525 2976 | 2961 | 073 
2 | 1079 32-7 3257 |100 65-4 65-4 327 | 1079 | 1075 14 
2 516 297 99-7 5-7 68-1 67-8 22:6 511 509 24 
1 294 17-1 17-0 2.8 68-0 70:2 17:5 206 305 3-4 
3 212 14-6 14.2 1:8 71:0 72-6 14-5 210 209 4-4 
5 151 12:3 12-4 1-5 74-4 74-7 124 154 153 54 
ТІЗЕІІ| 109 | 12 | 763 | zee | "29 119 | ns | 62 
2 84 9-2 9-6 12 76-8 76-8 9-6 92 91 = 
1 ml пе | ба | an |же) | міл 70: | = 
| 0 45 6-7 7-3 


е values of 7 only as far as r = 10. For larger values of 


k-point smoothing formulae with 52r. 
ard error of about 2, so one place of decimals is 


T a and (ii), We first consider thi 
"à Smoothing could be done by using 

a Each entry in this column has stand 
Opri 

| = by eye. Experiments 

ing results. For the 


(1944, §146). For 


m This column was obtained by smoothing a graph of column (iii) 
| rmula did not give quite as convine 


е five-point smoothing fo 
the Poing smoothing formula, see, for example, Whittaker & Robinson 
а. application it would be Jn; = Jn, — Аво) (r = 2,4, біле), 
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(v) This column of differences is given as a verification of the smoothness of column о 

In fact minor adjustments were made in column (iv) in order to improve the smoothness 
У). 

Е numbers b, of column (iv) are roughly proportional to 771. This fact suggests и 
rb, should be formed and smoothed again in order to improve the smoothing of {ы a 
further. This process is of course distinct from assuming that rb/ should be constant, whe 
the function б, is a smoothing of the function b,. Pere 

(vii) and (viii) These columns have already been partly explained. The purpose © е 
improvement in the smoothing is more for the sake of the ratios жулт. than of the nr 
themselves. ( - 

(ix) Where the smoothing of yn, had no noticeable effect we have taken 0/2 sales л 
clearly typical that 642 = n,, since the eye-smoothing is unlikely to affect n, convincing у. 


i to 
Therefore if the smoothing is tested by means of a chi-squared test it will be reasonable 
subtract about two degrees of freedom. 


: d 9, ssume 
(x) We have scaled up column (ix) so as to force X rn; = X rn,. We can then ав" 
т=1 т=1 
9 
N' = N, convenient for applications of $6. Note that 2; b, 
т=1 
given by (19) and accepting ( 
degrees of freedom. Thus ou 
factory smoothings. 


(xi) r* is obtained from formula ( 


(n, —n,*[nz = 6-5, 80 that Ж? 


В ae КЕТЕТІНІ. 
65) as a good enough approximation, is not significant on ae 
т smoothing is satisfactory, though there may be other 8 


, ' „ог of 1 
2’). The larger is ғ the larger is the standard error е ИЙ. 
We may get some idea of the error by means of an alternative smoothing. The standard © 


қ ау 
of 1* can be very roughly calculated by an ad hoc argument, inapplicable to say 5". we - 
reasonably say that the variance of 2n5[n, with respect to all eye-smoothings will be abo 


: . 3 1 
the same as that obtained by regarding n; and n! as independent random variables У 
variances circumscribed by the inequa 


ae 6 
è : lities (26) and (27), or nearly enough, defined bY i 

Now if w and z are independent random variables with expectations W and 2, we p 

4%) E m. | 

E Xkr 


and hence, to a crude approximation, 


7(%) «М, WV (а) 


2 РА э 
iie Уш) _ V(w) Via) (109 
WB we + ўз. 
1% follows that Vis) = V(2nj]m;) _ 1 li (109. 
1%2 


(дайы) "а oa 
во that V(1*) = 0-732 x 0-0010 = 0-00052 and 
(xii) (see the second table) 


given byn, = 8/(r2 +r) 


< 
r= 
s remarkably good for 14 
O te 
у 5 ту 
8 discrepancy, since 2, 
ег 


+ fac 
ence to the sampl. " atisf? 
i ple that n” cannot be 8 
for sufficiently large values of ғ, since Жуу” — r 
A 


00 instead of being equal to М. 
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H РР | 
(i) (ii) (x) (xii) (xiii) (xi) (xiv) 
п, ny n 5 TÉ тж 
1 2976 2961 3000 | 3008 
297 | 08 0-73 0:67 
2 1079 1075 1000 1002 1 4 1.5. 
3 516 509 500 500 2-4 24 
4 294 305 300 301 3-4 3-3 
5 212 209 200 201 4-4 4-3 
6 151 153 143 144 5:4 5-2 
7 105 118 107 108 6-2 6-2 
8 84 91 | 83 84 — 7:9 
9 86 70 67 67 = 8-2 
10 45 -- 55 55 — -- 
11-15 156 -- 170 170 -- -- 
16-20 76 -- 89 89 -- - 
21-30 78 — 92 92 — — 
31-40 34 — 47 47 е - 
41-50 28 -- 29 98 = — 
51-60 10 -- 19 19 == ==, 
61-со 71 es 98 90 = = 
4290 1 = = = = = 


(xiii) The fit can be improved by writing ny = Aa"/(r?+7) as in equation (55), i.e. using 
YPothesis Hy. We find by equations (100) and (101) that A = 6017-4 and x = 0-999667. 


olumn (xiii) can then be easily calculated directly for r < 10 and by use of (102) or (103) 


Or; 2 : ee 
a 77 10. ((103) gives the correct values for и" and nip, to the nearest integer, and it gives 


n 5 { © 
Ew 7 89-96, as compared with 89-90 when (102) is used.) Note that DR 
"= 


but is still significantly too large. A better fit could be obtained 
ergence factor other than 27, 


nz = 365, which 


im 1 : 
Plies an improvement on т” 


Y the method of minimum x? or by using some simple conv 


Wb. ав ета) with а> 0, b» 0. " : 
(xiv) r** is defined as (r +1)”, ніп, and is equal to r(r + 1)/(r + 2). This column may be 
good. It is by no means clear which 


Co: Е 
Mpared with column (xi). The agreement looks fairly 
imates of the ‘true’ values of r* forr <7. Column 


82 better fit to Eldridge's data for r « 9 (and could be extended to be a better fit for all 7) 

5 an is column (xiii) but is not as smooth. Columns (xii) and (xiii) would be preferable if 

Ое theoretical explanation of the analytic forms could be provided. Such an explanation 

might also show why the fit is not good for large r, even with the convergence factor x. The 
tation оп у in equation (46) may be relevant. 

Ay is true, the population parameter C2, 9, given by (31); 

А 57-1 А [5-22]. . (106) 


mam Ne 1== 


(x) i 


can be expressed in the form 


Pormula (106) would give ¢2,0 = 0-00928, but this value is probably a bad over-estimate 
2. 

SÌ 9 A = 

"10е де is too large for large 7 and the terms of ya? Tr^ ші for large r make most of the 


epends mainly on the larger values of r repre- 
f Eldridge's data is not complete enough to 
could be estimated from equation 


Contri 


Sent, bution, Similarly, 4, о, біуеп by (30), d 


Calo °Ч in the sample, but Zipf's summary 0 
“late 6, о. Similarly, assuming Fh, the entropy, б» 
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(39), and this method could be expected to give close agreement with the correct ма 
since c, 1 does not depend so very much on the more frequent species. But I have no 

obtained a closed formula, resembling (106) for example, and the arithmetic required if no 
closed formula is available would be heavy. The estimation of measures of heterogeneity will 
be discussed again under example (iii). 


Example (iii). Sample of nouns in М. acaulay’s essay on Bacon. (Taken from Yule (1944) 
Table 4-4, р. 63.) № = 8045, 5 = 2048. 


r т, т т, ? т, T | ® r Мр 
1 990 11 24 21 1 31 2 41 1 
2 367 12 19 22 4 32 1 45 E 
3 173 13 10 23 1 33 1 48 1 
4 112 14 10 24 2 34 1 51 1 
5 72 15 13 25 1 35 1 58 ] 
6 47 16 3 26 5 36 1 65 1 
1 41 17 10 27 3 37 1 76 1 
8 31 18 7 28 4 38 2 81 l 
9 34 19 6 29 1 39 4 89 | 

10 17 20 5 30 3 40 1 255 1 


As in example (ii) we can state some 
If our foreigner learns all 2048 nouns t] 
all but (12-3 + 0-5) % of the population, 


по 
conclusions at once, without doing the am ғ. 
nat occur in the sample his vocabulary will gee 
assuming formulae (9) and (65) or (87). If he ©" 


/ ы н Е the 
only 1058 nouns his vocabulary will still represent all but (n, +2n,)/N = 19:3 me 
population. ii) 

. u g 
We now present three different smoothings corresponding precisely to those ofexamp lel 
, » 10* 
т ш ” n n? сы т** „и |4 logon, $ logor ae 
| dr dr ЕС 
| | 4 
1 990 | 990 | 1024 в | 9% 
i "EE ME | 1000 0741 067) oss| -oso | —0-65 0401 
| з 173 | 173 | 170 iva. | So 16 15 | -o30 | —097 | обе 
" 12 | 112 | 102 103 | 30 | 24 | 24 | Сор | —0-26 0-655 
5 72 | 76 55 5 | 53 33 33 | -017 =020 | 01% 
6 47 | 56 | 49 4 | $3 | £3 | 43 | Iois | —016 | 088 
7 41 42 35-5 35 $3 5-2 5-1 —0-12 — 0-14 0876 
| 8 Bl | за | See s | 25 6-2 61 | -0и | -012 | 09% 
9 34 27 99.7 29 7.3 7.2 74 —0-10 — 0-11 091] 
10 17 22 18-4 18 8-2 8-2 8-1 —0-09 — 0:10 10° 
11 24 | 185| 155 je 9-2 9-1 — 0-08 — 0:09 = 
12 19 | 160| 121 19 | — | 102 | 101 = = < 
13 10 | 13-7] 113 10 | —7 | ERE | jg = = 
14 10 | 109 б B 22 12-1 12.0 M == ^ 
15 43 96| 85 в | — | B1 | 130 = 2% = 
16-20 | 31 325) 30.5 == 14-1 А — — ue 
21-30 27 i) 14-0 ыы 
et == 31-5 E E 
SERLO 18] 4 сео ЛА ~ b == Е = = = 
SI490] 8 | | о | 13 | = | Z Z = = >” 
101— о I4 = | | 39] = = = 5 A 
255 Ко Ай 36| — = — — jo E. 
Р © | 254 |252 = == 
т, Was obtained ђ smooth; 
y othing An, graphically, 
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^, = S[(r? +). It is curious that this should again give such a good fit for values of » that 
are not too large (7 < 30). The sample is of nouns only and, moreover, Yule took different 
inflexions of the same word as the same. 

nj = Аз | (73 +r), where A = 2138-90, ж = 0-991074, the values being obtained from (100) 
and (101) as in example (ii). 


15 


The expressions У (ni—n,)?/ni, etc., take the values 9-5, 21-2 and 27-3. The values of 


r= 
X? would be about 2 or 3 larger. (See (19), (26), (27), (65).) There is no question of accepting 
ny for r > 50 but it is better than n7 for т < 15. When < 9 the values of 7* and r** (and there- 
fore of 7***) show good agreement except forr = 1 and r = 7. If the analytic smoothings had 
not been found, the value of 6* would have been smoothed off, with repercussions on the 
funetion ni. The discrepancy in 1* must be attributed either to a fault in the value of 
ni (and therefore in Hy) or must be blamed оп 7, (i.e. on sample variation). If I had not 
Noticed the anal ytic smoothings I would have asserted that 1* = 0-74 with a standard error 
of Something like 0-04. (See equation (105).) 

We now consider two of the measures of heterogeneity in the population, namely, сз у and 
б Ву (30) we can see that 6, = 0-00272, agreeing with Yule (1944, p. 57). Also 
3,0 = 0-00003957, so that by (30А) we may reasonably write с; = 0:00272 + 0-00013. 
Assuming Н, to be valid for r < 30, we may also estimate c» у by & o (30) as in equation (33). 


€ have, in a self-explanatory notation, 


1 |.39;-1 са 
: = —— at + Y rn, 107 
& (30 | Њ) = xs i + oF n). (107) 


M . 
Now, as in (72), 


7 „31 о a3? WT 
у Е 2 {B(— 321og,2) + (dob) — 82-924. 
хх 


30 p— 
But, as in (106), $; — 1 a" = 99-501, so that X roie = 16-577. It follows from (107) that 
à ДЕ Ur | 
“2,0(30 | Hy) = 0-00246. This is about two standard errors below its expected value, based on 
© simple unbiased statistic ё, The discrepancy may ават be attributed to the large 
Value of п”. If, instead of nj”, the smoothing п" is accepted for 7< 30, we would get 
5, (20) = 0-00267. (It was in order to obtain this comparison that we calculated ё, (30 | Hy) 
rather than 6. (50 | Hy). The fit of п" deteriorates at about r = 30.) 
The last hires columns of the table are related to the estimation of the entropy, c; ү. (See 
СТЕ a ined hically for r — 1 
Pquation (40) and the remarks following it.) 7 logy)”; was obtained graphically ) 
* ang 3 Бу numerical differentiation for 7 = 3, 4,...,10. (The graphical and numerical 
d m 
— ) of course calculated 
Values agreed to two decimal places forr = 3.) The column T logy)”, was 


ж log, ga ( ! + п Ш e. The crude estimate of the ‘entropy to base 10’ or ‘entropy 
WI Mer ETC 10 6 
r 7+1 u К 
1 => i B 
*Xpresseq in decimal digits’ is 10810 N- Xmlo&u! = 2-968 decimal digits т, 18 
Accepted for r= 1,2,3, + 10 we find that 
d орот) + X r} = 3-051 decimal digits. 
(10) = log V -5A 5 vd loge Logon) TAS ов) 5 


r=1 
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We shall next calculate ё, ,(50 | Н,), using another self-explanatory notation. Since, by 
Jeffreys & Jeffreys (1946, §15-05), . 
1 1 
од тарт ШЕ. 
it can be seen that 4 


1 10 H d m 
&, 1(50 | Hy) log, N — N | > Thy (o. logige + d logion, ) 
у 


50 ө 50 3log,, c 59 © 
10 . r 
+ тт 106107 +10602 У; rn” — =a En, + У rn, logy? 
п m п 51 


= 3:192 decimal digits, 


as we may see by means of rather heavy calculations, using the last column of the b 
together with equations (72), (74) and (92). The crude estimate of c, is the smallest of Ж 
three. This is not surprising, since the crude estimate is always too small in the special са: 
of sampling from a population of s species all of which are equally probable. 


т 
Example (iv). Chess openings in games published in the British Chess М agazine, 1951, т у 
the purposes of this example we arbitrarily regard the openings of two games as equiva? 


"дег 
only if the first віх moves (three white and three black) are the same and in the same orde 
in both games. N = 385, S = 174, 


т, т; ny n? учи 
1 126 126 126 126 0:39 
2 22 22 24-6 24 s 
3 5 7-6 ^81 8 2-0 
“4 4 48 40 4 3-0 
5 ш 2 3-2 2-4 94 4-0 
6 4 2-6 16 1:6 5:0 
7 0 2-2 141 1:14 6-0 
зы — 0:85 0-86 TO 
15 1 ке 0-66 0-67 8-0 
m = 0-52 0-53 9-0 
11 0 
18 1 
14 1 
16 
n 1 3-97 4-80 
36 1 
© = 


Anz was obtained by graphical smoothin: 
_ ћу was obtained by assu 


tbe 
қ " ere 
parameters 2 and А were ob x 16. n, = Фау, | = № e ^id 
and п, for 7 > 2 is then given by (81). Next рана = ВИТА A 846 
using equation (80). Then (82) Pen Po Was determined ав 0-00011304 = 1/8 (74) 


p= 2-040, во that, in accordance with (52) a” 


(p> 1/8846), 
(P < 1/8846). 


8 of Jn, , 
ming H, (see equation (52) 
tained from (91) and (89) 


0-19852 22-04 
fane p“ e 0p 
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Fi : T€ К 
ы equation (84) gives s = 1132. This then is the estimate of the total number of 
penings in the population, though the sample is too small to put any reliance in it. 


n; (r> 2) is simply (8 %)02-т) = 48[(r* —7). 
p just ва good a fit as т. It gives an infinite value to c, 9, but this is not as serious an 
sa ion as it sounds since М, would also give quite the wrong value for сь о. (Cf. the 
neluding remarks in the discussion of example (i).) 
Бы in the table the values of 7* corresponding to лу, calling the values 7** in conformity 
or a convention of the present section. Clearly 7** = (r— 1) 2 when r>2. Thus the 
ра ie орон frequency of the 126 openings that each occurred once only in the sample 
39/385 = 0-001. 
Eros who learnt all 174 openings would expect to recognize about 67 % of future 
480 TBS for the same population, assuming that the sample was random, If he learnt the 
БЕ ene that each occurred twice or more in the sample the percentage would drop to 
4 and if he learnt the 26 that occurred three times or more the percentage would drop 


% 
949%. (See formula (6').) 


9. Index of notations having a fixed meaning. 
1. N, п, (but see also $2), no, q,,7* (ава definition of the asterisk, but there is a slight 


change of convention in $8), n; (here again there is a slight change in $8), &( ), V( ). 
$2. s, py, Н(рь Po D) = Н, бу, Ш. 
$3. №. 
$5. tur = Бр бр x 
$6. Cn ns Сох ©т,о› Čm, olt): 7 9 ёр En,r(t)- 


57. p, /(0). Por Hh to Но E(), 5, 8,7”. 
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CAPTURE-RECAPTURE ANALYSIS 


| By J. M. HAMMERSLEY 
Lectureship in the Design and Analysis of Scientific Experiment, University of Oxford 


This paper presents a new method for the analysis of capture-recapture data. Previous methods 
have employed deterministie or partially stochastie models of the population under study; the new 
method not only provides a fully stochastic model, but it also allows death-rates to depend upon both 
current time and the health and age of the separate individuals. The numerical analysis associated with 
the method is, however, heavy, and can only be recommended when one wants to extract the maximum 
amount of information from data which have required much effort or time to collect. The method is 
illustrated by an estimation of the death-rate of Alpine Swifts (Ариз melba) in the wild state. Some 
thirty years having been spent in collecting these data, it does not seem disproportionate to have 
Spent several months in computing the results. The larger part of this calculation was done on ordinary 


desk calculators. 
Some unusual problems in maximum-likelihood estimation are discussed. 


INTRODUCTION 


analysis is now extensive.* To acquaint himself 


The | 
he literature upon capture-recapture 
в given at the end of this paper, and then follow 


With j 
^ ith it, the reader may consult the reference 
P the references cited in turn in these papers. To make the present paper fairly self- 


Conta; a ; ; y 
Ontained, however, we begin with a brief outline of the problem. 
nging) size of a population of individuals 


| Тће main object is to estimate the (possibly cha: 
ate. To this end, the experimenter 


| à 8. animals, birds, fish, insects, etc.) in their natural ste 
“ptures individuals from the population on а number of successive occasions. On each 


Осода; eMe ы Р 

ыры: the catch is to be considered as a random sample of individuals from the population; 

lat is to say, each individual in the population (if alive) has an equal chance of being 

Aptured on any given occasion irrespective of its age, health, type, etc., and of any previous 
Јо " 


Ptures it may have suffered, although this chance may vary from one occasion to another. 
ord is marked for it or on it to show the occasion of 


“ach time an individual is caught, a rec : bus А 
Capture; the individual is then returned to the population. Each individual in the 
Population may die at any time, the chance of death being supposed to depend only upon 
9 current time, and the current age, type and condition of the individual. The experi- 

| any given instant from records of the 


е s 
Se мы: has to estimate the size of the population at Ren паа! 
Vera] occasions of capture. The main difficulty is that this estimation depends upon an 


es MA x " р 
а iMate of the effective death-rate then prevailing in the population, and that this latter 


Mate is h. 

ard to make. аў OE 
© experimenter va or may not be able to distinguish between one individual and 
ter. For inst in the case of birds. he can place a numbered ring upon the leg of 
wi piu bers, he will be able to identify any 


Captured bird: and, if all rings bear distinct num 
rt У Б E ud tly recaptured. On the other hand, 
in icular bird upon every occasion that it is subsequently recap 


int on the wing, the colour and 
? case of moths, the marks may be only spots of paint or 5 


Posi; ге but failing to discriminate between 
М lon of d. indicating the dates of capture but tailing А 
| hens m ame history of captures. Merely for convenience 


ада офа whi d thes 
18 wł ave both suffered the i conve 
“Seribing ys ci pes shall suppose that the experimenter has fastened individually 


* 
La; | lm 
` Paes had ideas on the subject 170 years 8 


Anot} 
Cach 


in 
go and the last 60 years have produced a steady flow 
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numbered rings upon all sie пе > ка а — but we shall later see that as 
: i ill be carried through in the other case. | 
* pomi rectis twr the к captures individuals at а sequence of еб 
и Т.,Т,,....Т,; and that the individuals which are captured at some Bee or = 
are named 1,, Jy, ..., Г, by means of their distinct rings numbered 1,2, ..., n. The pe €: 
set of data then takes the form of an m x n matrix in which the element in row i and eo = Б. 
is 1 or 0 according as the individual J, was or was not captured (or recaptured) at the insta 
T;. Call this data matrix D = (d;;),,,. | - 
There are two ways of analysing D. According to the first scheme, which we call ‘ana ж 
by rows’, we first consider апу given subset of rows of D, and count the number of colum ; 
which contain 1 in every row of the subset and zero in every row not belonging to the vae 
Let us call this the z-number of the subset. We then analyse the set of all xz-numbers eu 
by considering every possible subset of rows. This method leads to very considerable al am ^ 
difficulties since the z-numbers so obtained are not independent of each other, and thereto ч 
cannot be combined in a simple fashion to yield the likelihood of D. Analysis by TOWS be 
basically the structure which underlies most previous attacks on the capture-recaptit 
problem; but, either because of the algebraic difficulties just mentioned or because : 
limitations in the data presented to them, previous authors have adopted one or more 
the following simplifying expedients: 


(а) Certain subsets of rows are grouped together according to some rule of pues 
and the analysis is confined to the totals of the o-numbers falling in the groups pe. be 
erated. This pooling of z-numbers may perhaps sacrifice some information. Also it m dua 
dictated to some extent should the available data not give complete records of indiv! 
histories. 


(b) No use is made of the z-numbers corr 
row only: that is to s 


ingle 
р Р igo a sing! 
esponding to subsets which comprise 2 an 
NS H СЕ . 95 
ау, we disregard those individuals which are captured only 009 he 


: -quence Ë 
ecause these special z-numbers и“ nates 
ы er 5 estin 

ciency, on the precision of which esti 


4 pou 
timate. Again this will be dictated if the 8 


orto? 
гыш! : 


(e) It is assumed that; death-rate is inde 
Д pendent of age a bi 
The alternative scheme is ‘analysis by columns’ нне 


(which, for the analysis of the Alpine Swifts, 18 ina з mul» change © 


23 e V! ire 

the data !) sweeps away all the algebraic difficulti раар e 
~ ” 4 50 Й 
and if the comprehensiveness of the data, culties and allows us to avoid, if we 07% 


disadvantage that it leads ( 
considerably heavier computi 


£ than in the ca. 
one or more of the assumptio Sn 


ns (а)-(е). Analysis b 
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for each column of D the likelihood of the observations in that column. Then the likelihood 
of D is the product of these separate likelihoods; for the histories of two individuals are 
mutually independent under the assumptions made in the second paragraph of this intro- 
duction. The most serious restriction in these assumptions is probably that the chance of 
death depends only upon current time and current age, type and condition of the individual; 
for Іп reality the death of one individual may affect the chances of another's survival, as 
In case of over-population. 
One other artifice, which will be fundamental, is the 


THEOREM. Parameter spaces may be transformed before or after maximum-likelihood 
estimation with the same final result in either case. 

This is certainly not an original statement, and it is quite obvious once made. It is, of 
course, false for most forms of estimation other than maximum likelihood. Statisticians, 
to whom I have mentioned it, fall into two classes of roughly equal size: those of one class 

new it from their cradles, those of the other show momentary disbelief. I would have 
belonged to the second class had not this captuce-recapture analysis forced the theorem 
0 my notice. 
A he analysis has thrown up a further aspect of maximum-likelihood estimation. Suppose 
at the likelihood L depends upon a parameter 6 with co-ordinates 0,, 05, ..., 0,. Then as 
è rule the maximum-likelihood solution is obtained as the solution of the simultaneous 
equations 
21100, = 0 (1=1,2,...,9). (1) 
What should one do, however, if no solution exists for these equations, or (worse) if the 


equations themselves do not exist (for example, the likelihood function may not be differen- 
Hable with respect to 0;)? Even the rigorous text-books seem to overlook this possibility; 
nnd yet it may arise in the very simplest of practical problems. For instance, suppose that 
9 Wish to estimate the parameter 7 of à binomial distribution having performed № trials 


о Е x 
Which 7, are ‘successful’. The likelihood function 1s 


ре N л"(1 п)", 
n 


“nd hence 
—N|(1-7) (п=0), 
122 _ | in-Nar) (0<®<М), 
Мјт (n=). 


shen 0<п< М, we get the usual solution т=п]; but when n = 0 or М, и миш“ 
tege = 0 has no solution. In these cases it seems reasonable to take 7 — 0 an a 
ерее уеју; f run te the known information 0&7 < 1, these values of 7 make the 
9 n 9od asl а (blà Tt is easy, however, to construct examples in which no ie 
ал ы ? Е 3 : 
к the желің on as pee e likelihood as large as possible. For instance, consider ЈЕ 
А oing kon ie т Pu sth when n = 0 and we have the prior restriction 0 ENS са 
Bener the f, гаг вж" dur: may be acceptable to some readers, and 1, for one, wou 
қанын а parameter restricted to a known 


EN | 
pn 0 bean unknown vector с ; 1 
AN ? єз ү wae gn ues of Ө. Since the likelihood function satisfies 9 es тА 
бен es — n 4 У d data and Ө belonging to 0. Let e> 0 be prescribed and le 

| Масе 1. — € for the fixed data. Then 


P denote the closure of the set of all біп Ө such that L2 L 
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©(є,) 2 (6) whenever e, > €z; and hence 0(0) = Ld O(c) exists in Ө. It isto be n" = 
©(0) may not have any points in O, it may not be a unique point or even а pc re 
set of points, and that the likelihood for any Ө in ©(0) may be less than ev ery L кайы | 
It is, indeed, possible to construct an example in which the likelihood is not cons aie 
yet, for almost all data, every point of the parameter space is simultaneously a max! ет 
likelihood and а minimum-likelihood solution; for example, it suffices to take L ее 
а positive constant or zero according as 0 is rational or not, except that L is to be sul deni 
modified for one particular value of the data (thus preventing a solution which is gee oid 
of the data). Nevertheless, I should be prepared to define (0) as the maximum-likelih 
solution, mainly on the grounds that it reduces to the 
solution whenever this latter exists, and that in many other cases it seems to produc! йде 
sensible answer. The problem now remains of specifying some expression for the јелене 
of this estimate; but I have по satisfactory idea how to set about this, In the first place ily 
has to define what may be meant by the variance of an uncountable statistic. ee 
even when the statistic is unique, one has to determine what should replace the con vention! 
expression 22 log 120, é0;. Presumably if one can calculate 
У for any unique Ө, then what is wanted 
М is a generalized averaging over O(0) 
M might denote normalized integr 


: 2 ikelihoo 
conventional maximum-likel 
e the 


Pix 
" natrii 
а variance-covarian = pu 
in general is an expression of the type МҮ, sures 
*£ 7, А "zer em 
- Thus, if 000) is measurable and not of zero 1104 


«aly, one 
ation with respect to this measure. Alternatively: © a 
might prefer an averaging defined as the extraction of 4 


number of other possibilities will Suggest themselves 
observe is that the averaging shall be regul 


; (m 

à supremum or infimun tion to 
А ‚ precaut: 

to the reader, The only prec? у 


i в à 2 apping 
ar in the sense that it is an identity mapP je the 
а point estimate Ө; for then the definition of the variance-covariance matrix will inclut 


Н i ; solub 
conventional definition. There is a further point to watch. Suppose equations (1) are pet 
in the ordinary sense except for j = 1. If We now estimate 0, by the foregoing gener" thes? 
nique and substitute this solution into the remaining equations, then some or all of 


the 
ELLA au shen 

remaining equations may no longer be soluble, We may expect this to happen W he 1 we 
estimate lies on the frontier of parameter space: and y i 


"M 
à ve shall have an instance of it Y 
come to consider the data for the Alpine Swifts, | 


^ 


A STOCHASTIC MODEL F 
Let p; = 1—q; denote the prob 
occasion Т, is captured ( 
ability is constant for all 


OR CAPTURE-RECAPTURE 


ANALYSIS 
ability that a 


‚үе oP 
| alive 
dividual IL, known to ven p ob 
n TEG occasion, Since (b hypothesis) ММУ гә 
individuals at risk of Capture, p, is an m zi dm ехрегіте! (pe 
asion T^. Let фу " 


quy bha” 
= 1— y denot 'obability t°? ote 
1 1, қ enote the proba n 
al J; is alive on the occasion Т, |, given that it ig alive ^ I Т. Let His T ел 
the probability that the individual 1, is recaught n occasion 7). i 
that it is captured ( S нер 


D " е 
оп occasion 7 me occasion subsequent қ Toro 
sion 7., T 
i: Then we have the fundamental @ 
Pig = ( 


т = 0 holds because the 


2 «her C% os 
т- For a proof of th experimenter makes no furth kn 


ы | е "e^ 

П Occasion 7, Then , difference equation itself, suppos? ., js E. 
caught, it must certainly Survive inii] ime We know it is alive on occasion Ty- 70 
ability Фа; and the Ioni 


о i8 P o 
7+1 àt least, of which event there n ali 


5 3 ads tP 
n Occasion ТА given that it 18 
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pa probability р; 1), or it must escape capture on occasion T,,, (with probability а, 3) 
к teat be captured on some subsequent occasion given that it is alive at T;., (and the 
, 7 ability of this is /4:41,3): The ‘either-or’ of the previous sentence is exclusive; and (2) 
ollows at once. Now from (2) | 
фи = Óy(Di t i32 Pig Pisa t dada) = Hiz 
and 
Haz = Фуа, + 9gPial ад) 2 ultus 
I ; ; 
UN the various probabilities lie between 0 and 1 inclusive. By putting p;,, = 1 and 0 
Spectively in these inequalities we deduce that 
| $5237 Gli (3) 
18 i six : В 
bis best possible set of inequalities operating upon the /'5. Hence (3) defines the para- 
и ег space u, The parameter space ф is evidently the closed unit hypercube. From (2) 
ve also have 


Hiyj7 Фа = Фл, 1 + Gilles) = Фа, 7 1 15); 


and 5 Фф һу ki-i (j= 
ш $5 = — jos E sees tit (4) 
Also Pi фолу Фе _ Piu — Hi) - (филь) 
qi [n inj Инь 
Кај“ бан @ = 2,3 т) 
321437 ш. = Бұд 2); 5 
Фе, 14-13 (5) 


"Ippose 1, is captured on occasions ЛІ, ...,Z and on no other occasions, and that we 


Consi а РАДА а 
sider only information upon J; on occasions from 7, onwards. The likelihood is then 


92 Чат 40 (+ _ 
Th Lj = udi бор Фа Ва о Ро 4.05 --- do (1-). (6) 
? likelihood of D will now be Jis И И (7) 


We wish to take into account the possibility that J; has remained uncaught in the popula- 


N from some unknown instant of entry Ж()), then Z; must be multiplied by 


фид PROS фл а, кодкома ++ lami 
J) is a parameter to be estimated (or à random variable to be integrated out). 
© maximum-likelihood solution results from choosing, subject to (3), the available 
mm meters so that L is as large as possible. Before we can do this, however, we must by 
шо assumption reduce the number of unknown parameters; for at present there are more 
Ca, 9Wn parameters than observations (ie. elements of D), and hence the parameters 
the. be uniquely determined. The analysis will, moreover, not be tractable from the 
“tical point of view unless we keep the umber of unknown parameters fairly small 
assume that all individuals were equally healthy. At 
ience, we may employ the theorem stated in the 
ansform the parameter space before maximizing 
ansformed space, we can transform 


in Whice 
h А 
ты 


(вау п 
the g 658 than 100). Thus we might 
WE ао time, also for numerical conven 
\ (7) “ction, According to this we may tri 

n : 1 in this tr: 
the then, after estimates have been found in с 
? int : his technique in the illustrations on Alpine 
Зу АСК о the original space. We shall adopt this q 


^v p-gpace into |-врасе. Another technique worth 
боны: y employi 4) and (5) to carry p-space in | à 
Sidering i ng (4) r mber of parameters, by the introduction of 

‚ “ting is the deliberate use of an excess number ot p: : 


Опер ы 40 
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new parameters which depend in a functionally determinate manner upon the minimal = 
of parameters assumed in formulating the model. In the augmented parameter space 
minimal set will be some hypersurface. We may then maximize (7) upon this hypersurface 
by the use of Lagrangian undetermined multipliers. This will prove a convenient procedure 
when it is easier to work with many simple equations than with a smaller number of com- 
plicated ones. 

The parameters which are sought are the ¢’s and the p’s. The former specify death-rate, 
and the latter lead immediately to estimates of population size. 


AN EXPERIMENT WITH ALPINE SWIFTS 


We shall analyse some data collected upon two colonies of Alpine Swifts (Apus melba) a 
Solothurn in Switzerland from 1920 to 1950. 

A bird aged x years is called a nestling if v< 1, young if 1 <x « 2, and adult if v > 2. An 
ornithologist can tell the difference between these three categories upon inspection. Alp y 
Swifts migrate annually. If a bird returns to the place of catching when it is adult, it ' 


р у А е 
ornithologically reasonable to suppose that it will thereafter continue to return to this sam 


place each year until it dies. This does not necessarily hold for nestlings or young. Wes 


ets ers. 
ts leg. All rings bear distinct numb 
g; and it is convenient, in view 9, 
: +d is ваї 
apturing young. A bird is Вг 


of Pan ar. This s 
simpler assumption than that of the previous а ниш. | н нант supp? ; 
tion in view of the fact that most birds die due to natural hazards rather than through ? 

orological reasons amongst a ird 
etical one referring to an ‘average к 
the word ‘average’ are defined РУ 

that is to say, the death-rate for in 

ken rather than, say, the mea? ue 46 
co ф no longer depends upon а SU 50” 
y that an adult bird (known to be m in year i) 28 w^ 


‚ for mete 


a anomalous behavio i 
7 for the probability that a nestling (ringed at birth) bs = n үөү Р р. 4 
catching two years later. Define w = пјф “Роа 

We can now write down the probabiliti = 

нета J es for the various i sing 
[1] Bird ringed in the unsuppressed records as an adult in чөө Sai ооо s 8) 
iu 
L= | 
и = (1—0), 
[2] Bird ringed in the unsuppressed 4) 
: rec 

о, P, ...,0 and in no other years: ords as an adult in year i and recaught - 


cdi 9) 
L; = фе M " ( 
i i è = «Ч в... ШЫЛЫ 
[3] Bird ringed as a nestling in year i—] and cA 5 ат со 
І,-(1-л)-т Tecaught in the unsuppresse 
Фа1-д, yu 

alka) = 1-99 фор E ) (10 

by virtue of (2). =] | 141 — ilia 
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[4] Bird ringed as a nestling іп уеаг 2-1 and in the unsuppressed records recaught in 
Years с, fj, ...,0 and in no other years: 


L; = expression (9) multiplied by w. (11) 


е [5] Bird found dead in year 0: in expression (9) we must replace ¢(1—jg) by у. Hence it 
18 enough to introduce the factor 


1-¢ 
a 4 (12 
(l — о) ! 
[6] Bird killed artificially for experimental purposes in year 0: introduce the factor 
1 
s (13) 
1-/% 


То simplify the ensuing numerical analysis we now transform from the parameter space 
Фп, p) to the space ($, w, ). This is justified by the theorem quoted in the introduction 
and effected by substituting (4) and (5) into (9) to yield 


7 сор Faden s (1 — ш) 


да Чр +++ do 
= (а (ера) (вета обрве) Gu 
= ви) (— ны) (= іше) Е ЕЕ мы) (st) ( => x ( PD 


Upon noting that и = 0 and collecting up expressions (8)-(14) inclusive, we find for the 
n 


likey: 
Kelihood of the whole set of data D, 


L= 04 GAT (a — wpt;)Pi (1 =) (Ф = пр“ (p; — Gliz)” (15) 


Whew 
ere in the unsuppressed records 
4 = number of nestlings which are recaptured; ; tured; 
i = number of nestlings ringed in уеат?—1 which are never recaptured; 


e, ~ (number of adults ringed in year jan 


— (number of adults ringed panem ‚4—1 and recaptured 
— (number of nestlings ringed before year ? — 


i кі "tificially ог recaptured dead in year i); 
CUM - of birds killed artificia 
i= и as i 5 which, having been ringed either as adults before и ү 1 ар s 
er ; 1 
young abro sali $, were recaptured som! after year i+ 1 but which wer 


recaptured in year i+ l; | 

155 Dumber of birds recaptured in year t+ 1 | M 
7 number of birds found dead but not artificially killed. A 

assumptions, it is not necessary to know individual 


d never recaptured) , 
and recaptured after year i) 
after year 2) 


D 


etime 


Шаш, : 
hig Ч thus seen that, under suitable 
tories, у 18-2 
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Let us renumber the suffices 7 so that they run over the values aun p eri ma ша 

sponding to the years 1920—50 inclusive. The data then yields A = 205, 
i B, с, D; Е; 
20 0 фл 2 Н 
21 17 0 2 0 
22 18 - 2 6 1 
23 0 = 5 4 4 
24 12 0 7 7 
25 33 + 2 3 7 
26 25 — 5 11 0 
27 12 0 11 0 
28 35 - п 17 0 
29 25 — 17 17 7 
30 0 — 21 15 7 
31 27 — 17 10 18 
32 24 — 2 18 18 
33 42 — 29 28 25 
34 60 - M 31 28 | (16) 
35 82 — 51 34 42 
36 100 — 51 26 58 
37 129 — 15 44 16 
38 103 — 41 51 9 
39 82 — bl 57 3 
40 129 — 58 47 17 
41 178 — 57 41 21 
42 125 - 53 36 37 
43 175 - 44 60 23 
44 113 — 65 51 33 
45 132 — 66 60 43 
46 155 - 90 61 61 
47 189 - 96 59 81 
48 216 — 103 42 87 
49 3 — 58 0 84 
50 328 + 18 0 0 


The summary (16) shows that the terms іп Z involving ji, are 


(1 — Изо) ($ — Изо). 
To maximize L we must therefore minimize Изо. According to (3) we get 
#20 = физ. m 


15) and pick out the terms in Из 


(1 Tha)" (p А1) (1— Plia) ($ — pha)? 
Once again use (3) to obtain from (17) 


Now substitute (17) into ( 


Изо = фия = фи. (18) 
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а ЗИ 
ғ We can now eliminate дь and д: from (15). The terms in 45 become 
| ф%1— фор)“ (1 — изз)! ($ — и) (1 — Филе)? (1 — ФИ) (Изо — фигз). (19) 

The analysis may proceed in a straightforward manner until we reach the next run of zeros 
in Z;. Treating these in а similar fashion we find 

Hog = физ = Фф» = Фад. 

Upon elimination of /tas; #27 and /4 from (15), the stage is set for logarithmic differentiation, 
and the estimation follows from the simultaneous solution, subject to (3), of 


(20) 


E; _ wB; РЕ с, " D; sf ФЕ; 1 ww р қ 
ш-дша 1-Өш 1—/% -hi И Фа e ‚24, 30, 31, ..., 48,49), (21) 
1 18 176 6 29 e 
— + + + + 
Ша- $ls 1— flee 1— фор» Ф Ios 1— физ 1— Физ (22) 
INNER жне =: 79 А 
Поз — Plog 1— ФИ а Под Физ’ e 
eee ee іш | _ 256% 
[tag — kso — 1— ФИ 1-фой» 1— Фо» 1— Фор 
17 11¢2 5д% тө 
+ -т + , 
ý 9-і» 1-ФИә 1- Plog Hos — Ф Иго (24) 
Изо = 0, (25) 
9108 [. 205 ,— Bili 
02 0z — = + 
до w E 1— ој; 
17ф э» _ ЕЛЕҢ 33 
7 = фор; 1— Ига 1— оре 
ЕР 2509 _ 120 — З5физу _ 259 (26) 
І фоизу 1— Фо І фор 1— ©з’ - 
alogL _ su| 2e _ нна om. 
ee дф 4 (6-4 I7 Фит 9 1-9 
17029 ү m Sus _ 2физз 
1 — фор ф- Ио 1 = физ 1— Фа 
3. 236 о _ аз 35 у 
*$- js ers Изв — ФИзз 1 — фоизу 
/ y Es 24доИә _ Пас. E: 17 
ы Г фору 1-99 9-0» 
22D flog + 15ф Иду (зо (27) 


71 фу 1- PHa KaT фИзо 
ation over i = 23, 24, 30, 31, ..., 48,49. The 


and (27), Zi denotes summ 
i 
s equations will be discussed in the 


In equations (26) 
method of solving the foregoing set of 28 simultaneou 
next section. 

The next stage is th 

($, w, р). As already men 

| co-ordinates on the fronti 
regarded (18) and (20) as algebr 

Ра 


the inverse variance-covariance matrix for 
uction, it is not clear what one should do with 
default of a better procedure, Т һауе 


е determination of 
tioned in the introd 
er of the parameter space. In 


aic identities; and thus taken 


‘роз = Por = Pos = Ре = 0 


2 
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as quantities not subject to sampling error. It then remains to calculate 


TE ia) (i,j = 22,23, 24, 25, 29,30, ...,48, 49); 
9 ди; VL Op 


ди; ди; \L Qo) 


9 (10L _ д (10L - x (52) 
= aa (Te , %% = 26 15%)? ^w = Zo (Lw) 


The algebraic expressions for these quantities are lengthy but quite easily obtained from 2, 
and therefore will not be printed here. The only non-vanishing c’s are Си» Cogs C. 
Cog = ба» Ci i+1 5 Citi, i Cio = Сур Cig = бш. 

If we write 


с д (29). Cio s (290) (i = 22, 23, 24, 25, 29, 30, ..., 48, 49); 


‘ww? 


(i,j = 22, 23, 24, 25, 29, 30, ..., 49); 


Coj i баш Cug 

%) 1 Фа 0% 
then —C-! is the variance-covariance matrix of the estimates in the parameter space 
(в, ©, ф). To transform to the (р, о, $) space we take a first variation of (4) and obtain 


1 (ф — Hi) Iti 
да; = — бы + ди od, 
ЫИ ЕЗЕШ а Ba ay? 


with the exception 


945 = 


1 Фф — Mos) I 3d flog (P -H ) 
= би. + à 25 — 29 25/ gy, 
Ф(1 — фи) dis (1— фи) Mint Pl- pha) (1— Физ)? $ 
Consider now the 26 x 27 matrix U whose rows а; 


nd columns are indexed according to the 
following scheme: 


22 23 24 25 i 20: 30:48 40 i о ф) 
23 x x | | T 
24 m X m 
25 x x я 


(28) 


£- 


~ 
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and whose elements vanish except in the last column and on the leading diagonal and super- 
diagonal. These remaining elements are specified by > 1 


Шай 1/o9(1 — 4) (i = 22, 23, 24, 29, 30, ..., 47, 48); 
ug = (i)b) 
и, = ша/9%1-ш) 
26,5 = — 160 – Фиљо); 
изо = #%(ф—/ь)/(1— Ф) 


Missus Ex Hos __ Зфизь(ф — #25). 
28 T ф— и) (1 = Play)?’ 


= 1; поду = Mog = Ugu = 0. 


} (i = 23, 24, 25, 30, 31, ..., 48, 49); 


и = Ugg 


Then the variance-covariance matrix for estimates in the (р, о, 9) space is —UCAJU',, 
whose rows and columns are indexed in the same fashion as the rows of U in (28). 
'T'he estimates of p themselves follow from (4). 


NUMERICAL ANALYSIS 
The main numerical labour lies in the solution of the 28 simultaneous equations (21)-(27) 


inclusive. This was done by iteration as follows: 
(i) Choose trial values of ф, 0 and дь. Then use equations (21)-(24) inclusive to calculate 
Hgo «+3 Над 502 this last value of изо coming from (21) with 


ations will be called a minor cycle. The final value of Изо 80 
obtained should be zero to agree with (25); but in fact it will not be, due to the inexact trial 
choice of ду. So keeping ф, о both fixed, adjust the trial value of ә and begin a new minor 
cycle. Repeat the minor cycles with successive adjustments of аа until the resulting value 


of jg satisfies (25). 


in succession /l23: //4: 5) #29 
i = 49, This sequence of calcul 


Then 
(ii) use equations (26) and (27) to calculate О and Ф. The work so far covered in (i) and 
(ii) will be called a major cycle. 
ior cycles with fresh trial values of ¢, w. We now have three pairs 


(iii) Do two more maj 
of values of О, Ф. Carry out a bivari 
values of ¢, w such that О = Ф = 0. 

(iv) Finally, repeat the major cycles, 
(26) and (27) are satisfied. 

In carrying out the minor сус 


ate inverse interpolation in the ( О, Ф)-ріапе to estimate 


with successive approximations upon ф, ө until 
. Thus any minor cycle can be 
stopped, when only partly completed, as soon as (3) is violated by a poor choice of әз. 
This helps to shorten the work considerably. Increasing Из; increases all the other д’з for 

1 the other /’s. The latter values of д ате extremely 


fixed ф, w; and decreasing #22 decreases а. 
ч ин im small : in the final solution a change of 10-19 in jj, gave à change of 


changes in әз: imn 1 
4х10-5іпш Similarly, in the major cycles О and Ф were very sensitive to changes in 450. 
On that ace a + the work was с ficant figures throughout, with 


arried out retaining 10 signi 
a number of interpolating refinements built into the final stages. After the final major cycle, 
the computing errors in ¢ and о Wer 


Jes we have to observe (3) 


е determined variationally and found to be 4х 10-5 
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and 1 x 10-5 respectively. This leaves sufficient margin in view of the standard errors of 
th uantities. | 
Au sis foregoing work was done on ordinary desk calculators. A minor cycle could be 
i ; j in about 10 days. 
completed in about a day; and a major cycle in а | : 
The 27 x 27 matrix C was evaluated on a desk calculator. C was inverted on SEAC. This 
required a couple of days for coding, followed by about 4hr. machine time. In view of the 
large proportion of zeros in C, however, this inversion could probably have been accom- 
plished with about two months’ work ona desk calculator. Finally, the standard errors were 
calculated from the diagonal elements of — UC-!U' on a desk calculator. 


NUMERICAL RESULTS 


The final major cycle employed the values 


ф = 0:82177 252, w = 0-15231 687. 


i в 

92 0-49114 70078 
23 0-55283 78505 
24 0-48551 39880 
25 0-39790 15014 
29 0-57753 15026 
30 0-62797 49077 
31 0-68978 94273 
32 0-65635 69119 
33 0-64690 45524. 
34 0-63783 00155 
35 0-63643 78806 
36 0-57590 81349 
37 0-37055 55369 
38 0-37884 51461 
39 0-42295 19018 
40 0-50268 84610 
41 0-54322 89445 
42 0-57591 99689 
43 0-54504 16713 
44 059087 04335 
45 0-61829 34809 
46 0-64147 85385 
47 0-63248 97872 
48 0-57021 21461 
49 0-40731 92325 
50 0-00000 00000 


these values being interpolated between the independent minor cycles 


Бәз = 0-49114 70079 leading to Изо = + 0:00002 70661, 

H22 = 0:49114 70078 leading to Изо = — 0:00001 02407. 
For a reason to be noticed presently, 
is an X-bird if and only if it is a nes 
adult when ringed. Since ringing, 
these two classes are mutually exc 


we classify all birds as either X-birds or Y-birds. A bird 
tling when ringed, and is a Y-bird if and only if it is ап 
by definition, refers only to the unsuppressed records; 
lusive and exhaustive, The following table summarizes 


ж 
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for each year the number of birds captured, together with estimates of the experimenter’s 
catching efficiency and of the current population size: 


Number of birds caught 
Year Catching efficiency Population size 
(р; £standard error) | ( + standard error) 
X-birds Y-birds All birds 
= — = |- — —— P5. 
1920 0 3 3 — per 
1921 17 2 19 0 | == 
1922 23 0 23 0 == 
1923 2 2 4 0-100 + 0-096 40 +38 
1924 19 11 30 0-364 + 0-154 89 + 35 
1925 36 16 52 0-320 + 0-105 162 + 53 
1926 32 6 38 0:241 + 0:084 158 + 55 
1927 12 11 23 0 = 
1928 37 4 41 0 — 
1929 33 0 33 0 == 
1930 7 3 10 0-201 + 0-071 50 + 18 
1931 38 7 45 0-240 + 0-082 188 + 64 
1932 38 17 55 0-533 + 0-094 103x 18 
1933 | 64 20 84 0-430 + 0-080 195 + 36 
1934 82 25 107 0-412 + 0-065 260 + 41 
1935 113 22 135 0-384 + 0-060 351 + 54 
1936 141 35 176 0-468 + 0-056 376 x 45 
1937 170 64 234 0-525 + 0-052 446 +45 
1938 121 17 138 0-116 + 0-028 1190 + 285 
1939 91 9 100 0-066 + 0-022 1515 + 494 
1940 136 2 138 0-024 + 0-014 5726 + 3279 
1941 202 8 210 0-150 + 0-035 1401 + 326 
1942 143 25 168 0-201 + 0-041 837 + 173 
1943 213 42 255 0-342 + 0-050 745 + 109 
1944 139 16 155 0-177 + 0-035 876 + 172 
1945 172 30 202 0-264 + 0-042 765 + 121 
1946 191 39 230 0-309 + 0-041 743 + 100 
1947 236 60 296 0-403 + 0-043 734+ 78 
1948 971 58 329 0-464 + 0-041 | 709 + 63 
1949 47 72 119 0-484 + 0-043 246 + 22 
1950 285 55 340 -- же 


Тһе expectation of life of a bird is 1/(1—¢) years, and this, together with its standard 


error, is 5:61 + 0-30 years. 
The probability that а bird will survive 2 years from birth and then return to its place of 


birth is фо, and this, together with its standard error, is 0:1252 + 0-0088. Thus about one 
nestling in eight returns two years after birth. 


CoMMENT ON THE DATA AND THEIR NUMERICAL RESULTS 


by H. Arn of Solothurn, who was primarily interested in quite 
for which his data have now been used. In particular, because 
of his other interests, he made a deliberate attempt in some years to catch unringed adults 

result the catching is not 


and in other years to ring as many nestlings as possible. As a 
random, the various pirds at risk have not always had equal chances of capture, and the 
баанан] assumption of the present capture-recapture method is not satisfied. The table 


above shows evidence of thislack ofrandom catching. Forexample, in 1949 the experimenter 


The data were collected 
different problems to those 
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appears to have concentrated on capturing adult birds; and in this year he has secured тч 
unusually large number of Y-birds in a fairly small total catch. As a result in 1949 his 
catching efficiency (based upon the false assumption that he had been catching at random) 
is artificially inflated, and therefore the estimate of population size is made much too low. 
The reverse effect takes place in the years 1939, 1940 and 1941 when he has evidently con- 
centrated on ringing nestlings. 

It was nevertheless decided to analyse the data as though they had been based upon 
random catching, because considerable ornithological interest attaches to the age of 
Alpine Swifts in that they enjoy an unusually long expectation of life in the wild state, 
and because the present data is quite unique in its extensiveness. 
random catching is not here valid, the numerical results must be 
deal of reserve. The individual year-by-year population sizes carry very little weight, 
although the general tendency of the size of the population to increase with time is borne 
out by other evidence not presented here. The probability of 1 |8 for the return of a nestling 
two years later is also distinctly suspect. On the other hand, the estim 
life is in good accord with estimates obtained from other less extensiv: 
less liable to be incorrect although its standard error may well be wrong. 

The fact that the numerical results are for these reasons suspect will concern the ornitho- 
logist: but the main object of the present paper is to provide a theoretical analysis of the 


general capture-recapture problem, and to show that the analysis can in fact be handled 
numerically. This has been demonstrated. 


Since the assumption of 
interpreted with a good 


ate of expectation of 
е experiments, and is 


ns on the subject of 
Prof. В. A. Fisher, Mr P. H. Leslie, and 
debted to the referee for useful commenta; 


capture-recapture analysis with Dr D. 
“Prof. P. A. P. Moran. Finally, I am in 
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THE USE OF CHAIN-BINOMIALS WITH A VARIABLE 
CHANCE OF INFECTION FOR THE ANALYSIS 
OF INTRA-HOUSEHOLD EPIDEMICS 


By NORMAN T. J. BAILEY 
Nuffield Lodge, Regent’s Park, London 


1. INTRODUCTION 


An important characteristic of highly infectious diseases is that they tend to produce groups 
of cases occurring within households rather than separate cases scattered throughout the 
sehold were due to some single source of infection such 


community. If all the cases in a hou 
as typhoid-infected water, then we should expect the total number of cases to follow a 
binomial distribution. On the other hand, if the disease were introduced into the household 


by one of its members and then transmitted from person to person, a very different situation 
would arise. For a disease involving а short period of high infectivity and an approximately 
constant incubation period, we should expect to be able to distinguish different generations 
of the intra-household epidemic. There might be a binomial distribution of secondary cases 
resulting from contact with the primary case, to be followed later by another binomial 
distribution of tertiary cases amongst the susceptibles who had previously escaped, and 
so on. In his classic paper on this subject Greenwood (1931) introduced such a chain- 
binomial model for the investigation of measles epidemics, for which the assumptions made 
are thought to be approximately true. When the period of infectivity is more extended, as 
with scarlet fever or whooping cough for example, the stochastic model recently discussed 
by Bailey (1953) may be more appropriate. Greenwood examined data on the 1926 measles 
epidemic in St Pancras, and showed that, so far as the total number of cases in a household 
was concerned, the hypothesis of a simple binomial distribution was quite inadequate, 
while the distribution expected from the chain-binomial model gave a satisfactory fit to 
the observations. Itis important to notice, however, that this material provided information 
only on the total size of the epidemic in a given household; no analysis of the individual 

Bennett, Allen & Worcester (1939), in their 


links of the chain was possible. Now Wilson, r ) 
investigation of cases of measles occurring in Providence, Rhode Island, during 1929-34, 
were able to go a stage further and break down the data into its constituent parts. Of course 


it must be recognized that departures from a constant incubation period, or the possibility 
of multiple primary cases, give rise to certain difficulties in the so-called chaining of this 
kind of material. However; it seemed likely that in the present case this source of confusion 
would not be serious. Wilson et al. were able to show that їп none of the groups of available 
data did the Greenwood model give an adequate deseription of the numbers of cases 
е separate generations through which the epidemic passed, though it did 
ory fit to the distribution of the total number of cases in а house- 
(1947) considered a somewhat different approach. According 
two susceptibles in addition to the first case, for example, 
ance of one susceptible being attacked and the chance 


occurring in th 
sometimes give а satisfact 
hold. In a later paper; Wilson 
to this, in families containing 
one could estimate separately the ch 
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that the other would be attacked given that the first had already become infected. This 
principle can be extended to families of any size, though it has the appreciable жу 
thatitisnot clearly related to the epidemiological process known to be taking place. A further 
discussion of these problems was undertaken by Greenwood (1949). He made the important 
observation that the failure of the binomial distribution to represent the distribution of 
cases at stages in the chain where a goodness-of-fit is possible could be due to variations 
between households in the average chance of infection. Such a state of affairs might well 
be expected to result from natural variations in both heredity and environment. This remark 
of Greenwood’s is taken as the starting point for the present paper, in w 
a satisfactory mathematical model can be obtained, at le 
data available, by retaining the hypothesis of a ch 
with the additional assumption that the aver 
households according to a suit 


instead of only one as previously. Another important modification of detail in Greenwood's 
original scheme has lately been suggested by Lidwell & Sommerville 
(1931) assumed that the chance of an 
number of infe 


hich it is shown that 
ast for the small amount of suitable 
ain of binomial distributions, together 
age chance of infection varies between different 


(1951). Greenwood 
y susceptible being infected was independent of the 
ctious persons to whom he is exposed. Lidwell & Sommerville, however, 
considered the alternative assumption that the risk of infection for any susceptible is the 
same with respect to each infectious individual. This of course affects only families with 
more than two susceptibles in addition to the primary case. In their analysis of the dis- 
tributions of the total number of cases of the common coldin households, Lidwell & Sommer- 
ville showed that on the whole the Greenwood model was inadequate whereas their own 
modified scheme fitted quite well. For a useful discussion of the e 


of the problem of estimating infectiousness Hope Simpson’s 
consulted. 


pidemiological aspects 
(1952) recent paper may be 


olds of three individuals, contair 


ning two susceptibles besides 
У suppose that the probability of a 


able f-distribution. This involves fitting two parameters ` 
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children under 7 months or over 10 years old. Moreover, some of their tables are really 
relevant to families containing more than three children. We have decided for the present 
purpose to confine ourselves to the data on households of three, containing just two further 
susceptibles, apart from the primary case, at risk between the ages of 7 months and 10 
years. This seemed to be the most reasonable way of selecting what might be expected to 


Table 1. Greenwood chain for households of three 


Expected Observed Providence i 
Type of chain ~ of no. of measles data F xem 
` households households (see text) ханив 
->0 пф? а 34 14:9 
1 ка 1-0 2npq* b 25 23-5 
111-0 2np'q с 36 87-7 
152320 np? d 239 207-9 
Total n n 334 334-0 


be a fairly homogeneous group of data. In analysing their data Wilson et al. estimated the 
parameter p by equating the expected size of the total epidemic to the observed value. This 
is not necessarily very efficient. They then examined the fit to the four classes shown in 
Table 1. A more satisfactory procedure is to estimate p by maximum likelihood using all 
four classes shown in the table. It is easy to see that this efficient estimate is 

ф = (b+ 20+ 2d)/(2a+ 3 + 3 + 2d), (1) 
with large sample variance var 9 = pg[2n(1 4 pq). (2) 
o the data shown in Table 1, we find p — 0-789. The corresponding 
expectations appear in the right-hand column. The goodness-of-fit y? is 59-8, which for 
2 eas of freedom gives P « 0:001. Thus the Greenwood model, which for groups of three 

© 


i ‘fication suggested by Lidwell & Sommerville, is quite inappropriate 
en 2 n dh actual observations with their expectations in Table 1 shows 
that there are more families than expected, in which the epidemic either does x та 
the primary case or else involves the remaining two susceptibles ciclo à E А 
in with Greenwood's (1949) suggestion that the chance of infection ed vary жы ro 
to family. Suppose we now assume that this source of variation in p between households 


is represented by the f-distribution 


Applying formula (1) + 


1 2-1(1 9-14 0<р<1 (3) 
ТР — рут ар, (0<р<1). 
aF = Bey)" 
с і kind of chain over all households 
i average the expectations for each Love 
dis eu cp pierde of ж and y. As the individual frequencies in Greenwood or 
Ма не wen е be represented in terms of quantities like p'q*, it is convenient 
idwell chains са: 3 . a 
to work out the expectation of this quantity. We have 
Бр = [ера dp| Bv, y) 
0 


= Ва +r, у +в)/В(®, y). 
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Therefore 
Eptgs = 2(2 +1) ... (2 +7— 1) (9 +1)... (y+s— 1) (2 + у) (zy +1)... (:+у+"+8—1). (4) 


In particular 
Ep = (+1)... (2 +7— 1 (2 +) (2 фу +1)... (2 +у+7—1), (5) 


and Eg = y(y +1)... (у+8-— V) +у) (2 +у+1)...(2+у4+8—1). (6) 


ТЕ we apply these results to the simple expectations shown in Table 1 we obtain the derived 
set of values set out in Table 2, which we shall proceed to analyse by the familiar technique 
of maximum-likelihood scoring. The maximum-likelihood scores for т and y are clearly 


8 b+c+d c+ 4) т т b+c 
„= + - + | 
2 2+1 2+9 ®+у+1 x+y+2 (7) 
at+b+e a+b ( т т 5--с | 
iS, = - + 
апа 2 | y "РЕЈ =+у EI 
Table 2. Modified chain for household of three with variable p 
Observed Providence 
"Type of chain Expected no. of households no. of measles Fitted 
ў households data values 
(seo text) 
1-0 ny(y +1)/(-+Е у) (+y +1) 4 > à 
11-0 2nzy(y + (2 + y) (zy + 1) (e+y+2) b > 34-9 
1.513150 гпуг(2 + Y)/ (2 +y) (x+y + 1) (+y 4-2) 6 an 22-7 
15220 пао + 1) [(2 + у) (2 4- y 4- 1) d 8 6 87-6 
Total n 39 238-8 
334 334-0 
The expected amounts of information are 
m s n(x + 2y 4-1) па: + By 9) 
= ay) yl) еее: =й 
VEM (аут) 7, 
„= 
(2a-+y +1) в. 
Ly = = ny( t+Yy +2) 
Й 8 
ООУ Бета eg, и 
т 


n 
= ЕГЕ пау 

(2+0) Е уау) | 
NAR 
In many cases it will be easier to use the ' 
derived from the three different expressions in bracket; ме of Infor. 
simply by replacing the denominators by their Squares : и. up 
| An easy way to obtain two rough estimates оға к | А 
is to amalgamate the second and third classes їп Т, Y to commence thei А 
to their expectations. This gives able 2, and + е iterative process 


e 
п equate the observations 


mation, which can be 
the two scores in (7) 


t= (b+c) (5--с--94 
ала За 


(b Не) 


Ý = (b+c) (2240 | 
а + У Над (4 суа | (9) 


y" 
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If we now use these results on the Providence measles data, we obtain first the initial 
estimates from equation (9), giving 
d = 1142 and ў = 0-273, 


for which values the scores are S, = — 1:1253 апа S, = +2-1255. The information and 


covariance matrices are respectively 
6-2401 +1:3151 
| (10) 


63-8027 — 226-696 
Г = = 1078 
u + ссе and (уј=10 | 5 0-3701 


The adjustments to be made to 2 and jj are therefore — 0-042 and — 0-007 giving 1-100 and 
0-266, respectively. A further stage of iteration makes a slight change to 

2, = 1:092% 0-250 and 93 = 0264 + 0-061, (11) 
Where we have followed the estimates by their standard errors derived from the covariance 
matrix, and the subscripts refer to the size of the household. The expected values are given 
in the right-hand column of Table 2. The fit to the data is now very good indeed, the x? with 
one degree of freedom being about 0-32. It is interesting to observe that since y is less than 
unity the -distribution is J-shaped with an infinite ordinate at p = 1. This is in keeping 
with the excess of families observed in Table 1, in which both susceptibles succumbed 


‚ immediately to infection by the primary case. 


3. ANALYSIS OF HOUSEHOLDS WITH FOUR INDIVIDUALS 


When we come on to households with a total of four susceptibles, i.e. three in addition to 
the primary case, then the modification introduced by Lidwell & Sommerville begins to 
have an effect. It influences the tertiary distribution, in which, when there are exactly two 
secondary cases, the values 0 and 1 will occur with probabilities q? and 1 — g* instead of 
4 and p. Let us again use the Providence measles data, confining ourselves to households 

er susceptibles, apart from the primary cases, at risk 


of four, containing just three furth 
between the ages of 7 months and 10 years. The appropriate expectations and observations 


are all collected together in Table 3. The maximum likelihood estimate for the Greenwood 


model is plainly 
p= (b+ 20+ 2d+3¢ + 31 +30 +3 


od and Lidwell chains for households of four 


За +5b + 6c + 44 + 6e + 5f + 4g + 3h). (12) 


Table 3. Greenwo 
Ехресбей no. of households | Observed | -Provi Fitted values 
Expecte mf Provi- 
s dence 
y house- 
T f chain a data 
ype of спа E E Lidwell holds Ж Greenwood |, Lidwell 
4 0-9 1:2 
ng ng? a 
i kr зпра“ Зпрд\ b 3 0-4 0-7 
1511-0 6np*q* m 5 i a 55 
3npq* np*q ie me 
ee | | Be |» || T: 
302 np*q 2 Е 
е 52150 ША 3np*q(1 4) g 10 310 38-7 
->1-0 5 np? h 67 49-6 45-5 
| 15850 е? в n 100 100-0 100-0 
офа; 
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Substituting the Providence data gives p = 0-791, and the expected numbers are as shown 
in the table. The goodness-of-fit test, amalgamating the first four classes and then the next 
two, gives Р < 0-001. To estimate р on the Lidwell model we have to solve 


b 2c + 2d + Зе + 3f + 29 + 31. За + 46 + 4с + За + Зе + 2 -+-д 240 = 
5, 1—4? 
p q 
Taking a few trial values and then interpolating inversely soon gives the required root. 


We find, for the data of Table 3, that p = 0-769. Again a very poor fit is obtained with 
P<0-001. 
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(13) 


Table 4. Modified Greenwood chain for households of four with variable p 


Observed Provi- | Fitted 
Type of chain Expected no. of households he = 5 dence eet ae 
holds | data 
1-0 nyly + 1) (+ 2) (2 +)... (2+у 4-2) а 4 49 
11-0 Snay(y + 1) (y 9) (у + 3) (2 +) ... (x + 4-4) b 3 26 
1511-0 Gna(z + 1) y(y 1) (y - 2) (y + 3)(e+y)... (+y 4- 5) c 1 1-9 
1+2+0 Sna(z + 1) y(y + 1)/( +y)... (04-y +3) d 8 5:0 
1-1->1--1-0 Gna(z- 1) (+ 2) (+ 1) (у 2)/G-- y) ... (x+y +5) e 4 241 
1--1-2->0 Зпа(а + 1) (2 + 2) y(y + V) /Ge -- y) ... (2 + y 4-4) y 3 3:0 
1525120 Sna(z + 1) (2+2) у: у)... (x+y 4-3) 4 10 13-3 
1>3—0 ma + 1) (2 +2)(2 +y)... (2 + у+9) h 67 67-1 
Total n n 100 99-9 
| 


We now turn to our previous assumption that p varies betwee, 
A-distribution given by (3). Applying this modification to the e 
wood model in a manner similar to that employed for househol 
the expectations and observations shown in Table 4, which 18 
maximum likelihood scores are 


B, ee тасы 


n households and has the 
xpectations of the Green- 
ds of size three, we obtain 
analogous to Table 2. The 


x ®+1 2-2 


-(2 NE - n—a—h b 
Фу rctydcl ®+у-+2 уз + 
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Some additional accuracy can no doubt be gained if the observed numbers in the several 
numerators in (15) are replaced by their expectations. There seems to be no obvious way 
of finding simple initial estimates, but with a few trial values it is not difficult to get suffi- 
ciently accurate approximations. Scoring with ж = 1-43 and у = 0-31 gives 


S, = +0-4124 and S, = —0-2782. 
The information and covariance matrices are 


a-[ 14-6024 zz md Tb 10-2x[ 21-925 Ec 


—55.233 303-810 43-9861 10538 |" (19) 


The required adjustments to the estimates are therefore + 0-08 and + 0:01. We may there- 
fore write the maximum-likelihood estimates as 
&, = 1-51 + 0-47 and 7, = 0-32 + 0-10. (17) 


Again, the expected values appearing in Table 4 show a very close fit to the observations. 
Grouping the first six classes into three pairs leaves 4 degrees of freedom altogether. The 
resulting goodness-of-fit y? with 2 degrees of freedom is 2-20, a value which is entirely 
satisfactory. It hardly seems worth while at present to undertake a similarly modified 
treatment of the Lidwell chain model. 

Comparing the results given in (11) with those in (17) shows no significant difference 
between the estimates obtained for households of three and households of four. We may 
therefore combine these estimates. The weighted averages, with weights equal to the 


reciprocals of the variances, are 


4 = 1-18+0-22 and 9 = 0-279 + 0-052. (18) 


The average chance of infection, p, is given by putting r = 1 in (5). Thus we obtain 
p= “је +у) = 0:81, 

Tf in the future it proves possible to group data on the 
{ a disease according to some such classifications as 
it will be interesting to see whether p has a more 


(19) 


using the values appearing in ( 18). 
household distribution of the cases о 


al area or social class, ебе., 


geographic 
on within each of these groups. 


circumscribed distributi 


4, SUMMARY 
Greenwood (1931) was the first to introduce the chain-binomial model to describe the 
a disease with a short period of high infectivity and 


household distribution of cases of } : 
à constant incubation period. Greenwood obtained satisfactory goodness-of-fits to the 


St Pancras measles data, at least so far as the distribution of the total number of cases was 
concerned. Wilson et al. (1939) showed that on their measles data for Providence, Rhode 
Island, even if the fit to the distribution of the total number of cases was satisfactory, the 
theory was quite inadequate when the distributions of the separate generations of the disease 
Within a household were examined. Later, Greenwood (1949) suggested that this might be 
due to variation in the chance of infection, p, between households. In the present paper the 


Providence: data arë re-examined on the assumption that p varies between households 
edu on. Two parameters then have to be estimated. p appears 


according to a suitable £ -distributi А 
9 
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to have a J-shaped distribution, with an infinite ordinate at р = 1. Two groups of the 


Providence data for households of three and four individuals were investigated, and in both 
cases excellent fits of the theory to the data were obtained. 
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SPREAD OF DISEASES IN A RECTANGULAR PLANTATION 
WITH VACANCIES 


Ву С. Н. FREEMAN 
East Malling Research Station 


1. INTRODUCTION 


This paper discusses the spread of diseases from plant to plant ina rectangular plantation 
where there may be some missing plants, which will be described as ‘vacancies’. The 
problem is considered in two ways: (i) the degree of agglomeration of diseased plants on 
any one occasion, (ii) the spread of disease over a period of time. In the work which gave 
rise to this investigation the plants were hops, and the spread of the virus disease, nettlehead, 


over a period of five years was examined. 
The methods used for the first aspect of the investigation, which may conveniently be 


called spatial, derive from various writers, principally Krishna Iyer. Historically, the first 
attempt to deal with this problem was that of Todd (1940), although previously Cochran 
(1936) had examined the linear case. Todd considered the distribution of pairs of adjacent 

tion that the underlying distribution was the 


plants, or points on а lattice, on the assump 
binomial. The method of doublets, as he called it, has been followed in most of the subsequent 


work, although Finney (1947) showed empirically that Todd’s formula seriously over- 
estimated the variance, because in an agglomeration doublets do not occur independently 
of one another. Van der Plank (1946) applied Cochran’s and Todd’s methods to virological 
studies, but without knowing of Finney's criticisms. Furthermore, he only considered 
doublets in one direction. 


Moran (1948) considered the distribution of doublets for a general lattice, not specifically 


rectangular, and also found third and fourth moments in а particular case of the rectangular 
lattice. Krishna Lyer gave general solutions to the problem for a rectangular lattice in two 
papers; diagonal pairs were excluded in the 1950 paper, but considered in that of 1949. 
Moran and Krishna Iyer both distinguished two cases: (i) ded the probability of any 
plant being diseased is the same as that of any other plant being diseased and independent 
of the number of diseased plants, and (ii) where the number of diseased plants 1s known and 
interest attaches only to their distribution over the area. These cases are called free and 
non-free sampling respectively. Following the accepted notation, diseased plants are 
referred to here as black and healthy plants as white, and it is the distributions of both black- 
black (BB) joins and black-white (BW) joins that are of interest in practice. ] 
Todd also considered sets of three and four neighbouring plants, called respectively 
triplets and quadruplets; but for these Finney (1947) showed that the variance would и 
under-estimated on the binomial assumption. Sukhatme (1949) considered triplets pe 
stated that the inclusion of diagonals gave а more efficient test than the eo о 
diagonals, i.e resulted in smaller coefficients of variation. ‚Не showed, further, i У ri 
Sinacnsle were included the coefficients of variation for triplets were larger po Ug ог 
doublets. In consequence of these results no further attention is here paid to triplets. 
tioned writers has considered what happens when there are 
al formula can be adapte! 


None of the above-me? A ee : d 
i б ћ Moran's gener o this case. In 
о ae has been to generalize Krishna Lyer's methods 


the present paper the approach to the problem E- 
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f tangular lattice to make them applicable where there are vacancies rather than to 
т i in е : Moran's methods. The results must of course be the same, but Krishna Tyer в 
= the more elegant, for calculating both the probabilities of the number of joins 
Ps cen type and the total number of joins. (For this latter, the method of his 1949 paper 
has been preferred.) Both the cases, including diagonals and excluding diagonals, are = 
sidered, these being referred to, by analogy with chess, as queen and rook cases respective у. 
In the particular example given later in the paper the queen case was thought appropriate, 
but formulae are given here for the rook case as well, for completeness. 
The methods used for the second aspect of the investigation, which may be called temporal, 


derive from the work of Barnard (1947 ) and Pearson (1947). Barnard distinguished three 


possible types of significance test for a 2 x 2 table—of which the second or 2 x 2 comparative 
trial is appropriate here—and Pearson obtained formulae for actually testing the signi- 


ficance of observed results. All that is done here on this aspect is to consider the relevant 
method of analysis and apply it. 


2. SPATIAL ASPECT 
Consider a rectan 


that there are у 
whereu+v+w = 


gular lattice m by n with vacancies, where m+n = 
acancies in the interior of the lattice 


In the non-free sampling case, then, the probabili 


T". 
ty of any point being black is EE 
and of any point being white is i = - Thus the probability of any join of two points being 
BB is 
т (п. — 1) or 2100-09)! (1) 
(b—U)(b-U —1) (m, -2) b — Ur 
Similarly the probability of a BW join is 
2nin, 

(BLTRIE — LI 

(5-0)%-1-1) (2) 


The corresponding probabi 
Further, two joins can 
points, in which case they 
four points in two discre 
non-free sampling the pr 


lities for free sampling are p? 
arise in only two ways. Either they can come from three adjacent 
will be described ав dependent pairs of joins, or they can come from 
te pairs, when they will be called independent pairs of joins. For 
obability of obtaining three adjacent black points is 

т! (b— U — 3) 


(n,-3) b 07у» (3) 
and of obtaining four black Points in two di 


and 25g, 


Screte pairs is 
т!(6б— U — 4)1 

"uu тш men ME 
(n4 — 4)! (5 — ТЫ 


ж = оң 


Ар“ 
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К For BW joins the position is slightly different. If three adjacent points are to make two 

oe BW pairs the points must be either BW B ог W BW. The sum of the probabilities 
of the different kinds of dependent BW pairs is thus : 


ту (ту — 1) Ng + ny (n5 1) Ny No(Ny + По — 2) 
(5) 


(b—U) (b—U—1) (b-U-2) (5-0)(%-0-1)%-0-2) 


The probability of two independent В W pairs is 


Ату! то! (6— 0 — 4)! 
(m —3) (па 2) (6— 0)! (6) 


for free sampling are respectively 23, p!, 


The probabilities corresponding to (3)-(6) 


Palp +q), 4p'q*. 

Now Krishna Iyer (1950) has shown that the rth factorial moment about zero for the 
ins is r! times the sum of the expectations of the different 
in the lattice. The argument employed in that paper 


to show that the above statement holds for both rook 


distributions of both BB and BW jo 
ways of obtaining r joins of that type 
(pp. 199-201) can be readily modified 


and queen cases, even if there are vacancies. 
All that is necessary, therefore, is to calculate the numbers of joins or sets of joins of a 


given type for the particular case under examination, and this is done below. 
(а) Queen case. On the lattice there are, in all, 4 corner points, 2(a— 4) border points 
and b— за +4 interior points. Thus the total number of joins of existent points on the 


lattice is 4, where А = 3.4+5.2(а—4) + 8(0— 2214) - 2V, 


й being a correction for vacancies, 1.6. 
A = 4b —3a-2— V. (7) 


to V, each border vacancy 5 and each corner 
if p is the number 


(8) 


Now each interior vacancy contributes 8 
vacancy 3, except that two neighbouring vacancies reduce V by 1. Thus, 
Of pairs of vacancies V = 8u+5v+30—p. 
If wis the expected number of joins of a given type we have, for non-free sampling from 
the appropriate expression (1) oF (2): for BB joins 
An,!(o—U —2)! 
fe Т ігі (9) 


A m —2) (b — 0) 


and > је 
for BW joins om А 

“-0-0)%-0-1) (10) 
where А is derived from (7 ) and (8). For free sampling р? and 2р4 are substituted for (1) 
and (2) respectively: | 
se a ndent and independent pairs 


The f дере: 
mame ot ОТР adjacent points, two joins can arise in B 


the first сазе, where there are three 
B= 3.4410. 2(a—4) +2800- 24+ 4) — ГА 


of joins must now be considered. In 
ways, where 


r vacancies, 1.6. 
(11) 


W being a correction fo 
В = 4(1b —9a-4- 11) - W- 
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Now each interior vacancy contributes 28 to W directly, each border vacancy 10 ыы - 
corner vacancy 3. Also each interior existent point with one nephpourig —€— r 
tributes a further 7 to W, one with two neighbouring vacancies 7 +6 ( = 13), one wi : ни 
7--6--5, etc. Similarly, border existent points with one neighbouring vacancy contri = 
4 to W, those with two 4+3, etc. Finally, corner existent points with one oh ша g 
vacancy contribute 2 to W, ete. Let there be 2; interior existent points with i neighbouring 


қ 3 кыр. ar m 5 - НИ 
vacancies, y; border existent points with i neighbouring vacancies and 2; corner existe 
points with 7 neighbouring vacancies. Then 


W = 28и + 10v + 30+ (Та + 1325 + 1825 + 22x, + 252 + 27, + 28x, + 28:4) 
+ (4 + Туз + 995 + LOY, + 1095) + (221 + 32, + 324). (19) 


. А(4—1 . T 
Since the total number of pairs of joins is ( ~ 2 , and pairs can only be dependent or 


;. A(A-1 
independent, the number of independent pairs is sesh - В. 


For ВВ joins we thus have, from 


(3) and (4), that the second factorial moment is (бәр 
Miere „= 28% 16-0 -—3)! [A(4-1)-2B]m!(b-U—4)! 
И вур]! + (n,—4)!(b— 17)! 


Now if 02 is the variance, 0? = hgt H= per. (13) 


, _ 286-03)! An, (5 — U — 2)! 
Черт (m,=3)!(6—U)! (а, —2) (6 у! 


4I -1)-2B]n,!(b— 0-4) ГА! = a4) 


(— 4)! (6— U)! (па —2)!(6— U)! 


Similarly, we have for BW joins, by substituting in (13) from (5), (6) 


and (10), 
сз = ___ У Вттојт +n. — 2) 4 2AN no 
(6—U) (b—U-1) (6=U—2) (b—U)(6—U=1) 


+ЧА(4 —1) 2B) n,! ng! (5— U — 4 [ 


2A 2 
(m= 2)! m- 2) (5 —U i vine | (15) 


b—U)(b—U —1) 


er of points on the lattice 
points on the lattice is A’ 


24' = 2.43.2(0—4) 4.4 
V" being a correction for vacancies, i.e, 


са | being the same аз before, the total 
number of joins of existent 


‚ Where 
b —2a 4-4) - 2y", 
(16) 


by 1. (Here, and for the re- 
> two points are only regarded as neighbours 
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if they are next to one another on the rank or file.) Thus, if p’ is the number of pairs of 
vacancies, p' being in general different from p as defined above, we have 
V' = 4u 4- 3v 4-2w — p'. (17) 


The expressions (9) and (10) in the queen case are now replaced by the following values 


for the mean number of joins: for BB joins 
пи.’ ЕЗ 
24'nn. 
and У joi -6-U)6-0-1 
nd for BW joins #= =U) (b-U-1)’ r 
A’ being derived from (16) and (17). For free sampling p? and 2pg are again substituted for 


(1) and (2) respectively. 
For the calculation of the second moment we again need to consider the number of ways 


of getting the two different types of pairs of joins. Now, three adjacent points can here be 


derived in B’ ways, where 
B'21.443.2(4—4)3 60 —2a4-4) - W^, 


W' being à correction for vacancies, i.e. 

В' = 6b —6a--4— W: 
6 to W' directly, each border vacancy 3 and each 
tent point with one neighbouring vacancy соп- 
uring vacancies 3 + 2, etc. Similarly, border 
ontribute 2, and so on, and corner existent 


(20) 


Now each interior vacancy contributes 
corner vacancy 2. Also each interior exis 
tributes a further 3 to W’, one with two neighbo 


existent points with one neighbouring vacancy 6 
points with one or two neighbouring vacancies 1. Let there be x; interior existent points 


with i neighbouring vacancies, yi border existent points with i neighbouring vacancies 
and z; corner existent points with i neighbouring vacancies, where 2; is not in general the 


same as а, etc. Thus 


W' = 6u+3v+ 2:00 (24 + 395 + 395) + (22 +). (21) 


(дал + Ба + 625 + 624) + 


4'(А'—1) ‚ the number of independent pairs is, 


Since the total number of pairs of joins is 
A(A'-1)_ p 
by the same reasoning as for the queen case, 2 3 


Hence for BB joins we hav 
sion as (14) with A’ written 


BW joins is given by makin 
required for the free sampling case are mer 


e, by substituting in (13) from (3), (4) and (9), the same expres- 
for A and B’ for B. Similarly, the variance of the number of 
g the necessary modifications in (15). As before, the changes 
ely in the probability expressions. 


3. APPLICATIONS 

put its application can best be described 
of which see the paper of Legg (1958). 
f hop nettlehead disease was considered over a period of five years, 
оша be regarded as being due to a natural transport of inoculum 
to do this it is essential to examine both spatial and temporal 


completely general, 


d aboveis © 
example, for further details 


The method as outline 
by means of à specific 
The apparent spread 0 
to determine whether it ¢ 
from plant to plant. In or der 
aspects of the spread. 
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The spatial approach is necessary for the consideration of whether diseased plants occur 
in agglomerations, or whether they can be regarded as occurring randomly in space. The 
hops in this instance were planted at a distance of 6 ft. square, and the method of training 
them was such that each plant touched its neighbours, along the rows, across the rows and 
diagonally. This meant that the queen case suggested itself as being the one to use. The 
result of the analysis was that there was strong evidence, in three out of the five years at 
least, that diseased plants were more often found in agglomerations than would have been 
expected had these plants been randomly distributed. 

As an example of the use of the method Legg's data for the year 1951 will be considered. 
Fig. 1 shows the distribution of plants showing nettlehead symptoms (black plants), those 
without symptoms (white plants), and missing plants (vacancies). The lattice is 11 x 30 
plants in area, and so a = 41, b = 330. Further, there are 122 plants with symptoms and 

196 without symptoms, and so these are the respective values of ny and no. 

The distribution of vacancies is such that u = 11, v = 1, w = 0, and there are no pairs of 
neighbouring vacancies, i.e. p = 0. We thus have 


U-1141-212 and У = (8х11)+(5х1) = 93. 
This gives 5 — U = 318, and, from equation (7), 
A = (4 х 330) — (3 x 41)--2— 93 = 1106. 


We now have to consider the existent points with various numbers of neighbouring 
vacancies, and we find that z, = 66, 2, = 8, 23 = 1, у, = 8, while the remaining z's and у» 
and all the z’s, are zero. Thus, from equation (12), 


W = (28x 11)+(10 x 1) +[(7 x 66) + (13 x 8) + (18 x 1)] + (4х 8) = 934, 
and so, from equation (11) 


В = 4{(7 x 330) — (9 х 41) + 11]— 934 = 6874. 


Before the next stage of the computation it is convenient to find А(А-1)-2В, which 
equals (1106 x 1105) — 6874 = 1208382. For BB joins we then have 


_ 1106 х 122 х 121 _ 


кх ші же ecco NE ВОН 
318 x 317 "-— 
and — o? pp 2X 0874x122 х 121 x 120 | 1208382 x 122 x 121 x 120x 119 |, 
318x 317 x 316 318x 317 x 316 x 315 ah 


which reduces to 530-66, whence с = 23-0. 

Similarly, for BW joins, we find д = 524-7, с = 15:8. 

'The observed numbers of BB and BW joins are respectively 220 and 408, the former of 
which exceeds expectation by about two and a half times the standard error, and the latter 
of which falls short of expectation by more than seven times the standard error. By thu$ 
using the standard error as a measure of the discrepancy between theory and observation 
it appears from the combination of these two results that it is very unlikely that the observed 
distribution of diseased plants was in fact due to a random distribution of these plants in 
the field. 

In the case of nettlehead, which is a virus disease probably transmitted by insects, and 
with this particular method of growing hops, the queen case of the spatial aspect must be 
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the one to use. This need not, however, always be the case, and there may be times when it 
is desirable to consider the spread of disease where diagonal joins are excluded. Furthermore, 
in the case of a disease believed to be spread by cultivations, these being carried out in one 
direction only, it may be appropriate to consider a linear lattice. (Since the formula for this 
case can be easily derived from that for a rectangular lattice by taking one of the dimensions 


Fig. 1. Nettlehead status of plants in 1951 
39 40 41 42 43 44 45 46 47 48 49 
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N = Nettlehead plant. V = Vacancy. + = Plant without symptoms. 


to be of unit width, it is not given explicitly here.) There may, moreover, be cases where 
Practically nothing is known about the disease under investigation, and then it may be useful 
to consider a sort of sequential approach, first using a linear lattice, and then, successively 
the rook and queen cases for the rectangular lattice. If this is done, and one method shows 
€vidence of spread but the next does not, the results may well shed some light on the method 
of spread of the disease in question, and thus provide clues to the agency of spread. 
Investigation of the spatial aspect of the matter is not in general enough by itself, and the 
temporal approach should be used in conjunction with it. The reason for this is that even 
definite evidence from the spatial approach cannot do more than suggest that diseased 
Plants occur in clusters. This is not sufficient to prove that there is infectious spread, for there 
May be ecological or other factors which could give rise to the same pattern of diseased 
Plants. For example, there could be localized conditions of вой or shelter giving rise 
to patches where the disease develops or where Symptoms are expressed. Equally a 
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physiological disorder can occur in patches. If, however, the temporal method shows that 
the disease is spreading throughout the plantation under investigation, while retaining the 
characteristic pattern of agglomerations, there would be strong evidence of genuinely 
infectious spread. 

The investigation of nettlehead disease provides one practical instance of the use of ше 
two approaches, and a second example is given by another disease of hops, the fluctuating 
type of Verticillium wilt. In one particular locality the appearance in one year of agglomera- 
tions of diseased plants suggested a spread of the disease, but these agglomerations were 
found to be due to the influence of flood water in the winter in relation to the contour pattern 
in this hop garden. Here the use of the temporal approach would have immediately shown 
the spread to be apparent rather than real. 

The temporal method is used to examine whether plants next to diseased plants are more 
liable to go down with a disease than those all of whose neighbours are healthy. To revert 
to the nettlehead example, the investigation concerned itself with the virus status in any 
year of plants healthy in the preceding year, in order to see if the disease spread principally 

to those plants whose neighbours were already diseased. Any plant which was missing the 
second year but not diseased the first year was tacitly excluded. 

To investigate this, the method of Barnard’s 2 x 2 comparative trial was used. For, as 
Barnard states, the fundamental subject-matter of a 2 x 2 comparative trial is two popula- 
tions, and here we have the two populations of healthy plants: (i) those having some 
diseased neighbours, and (ii) those having all healthy neighbours. These populations being 
given, records are then taken in the next year to see whether the proportion of diseased 
plants is the same in the two populations. It will be seen that, whereas there may be some 
doubt as to which type of spatial approach to use, the essential supplement using the 
temporal approach must make use of the 2 x 2 comparative trial. It is obvious that the same 
definition of ‘neighbours’ will obtain for the two aspects of the same investigation. | 

Fig. 2 shows the distribution of plants showing nettlehead in 1951 and not in 1950 1n 
relation to plants that showed nettlehead in 1950. The division of the population described 
above shows the following numbers of plants in each category: 


Plants in 1951 healthy in 1950 
Neighbours in 1950 
Diseased Healthy Total 
Some diseased 53 88 141 
All healthy 12 90 102 
Total 65 178 243 


In Barnard's notation, и being a variate distributed normally with unit variance, we 
then have 


, Gu) quxsp, Пат toe x 65x 178 
243 i 243 x 243 x 242 "' 


whence и = 4-48 with probability P < 0-001. 
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It thus appears that diseased plants are very much more likely to occur if some of their 
neighbours were previously diseased than if all their neighbours were previously healthy, 
which suggests that there is some form of infectious spread taking place. 

In the nettlehead example, the results of the investigation of the temporal aspect agreed 
very closely with those of the spatial aspect because those years when the spatial approach 


Fig. 2. Nettlehead status of plants in 1951 in relation to that of 1950 
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O = Nettlehead plant in 1950, irrespective of whether it had nettlehead, had be 
a vacancy in 1951. 
N = Nettlehead plant in 1951 but not 1950. 
V = Vacancy in 1951, not nettlehead in 1950. 
. = Plant without symptoms in both 1951 and 1950. 


en replanted or was 


Suggested agglomeration were those in which there was a tendence 
plants to be next to those diseased the previous year. Also, in thi 
Spread so far across the recorded area by the end of the perio 
Suggest that the liability of plants to infection was not related 
there was less likelihood of ecological factors accounting for the s 
when there were significantly more BB joins than expectation th 
BW joins, and when one did not differ significantly from expect; 

In most practical cases plants will be either diseased or hea: 
white colours are likely to account for all the existent points 


y for more newly diseased 
s instance, the disease had 
d under investigation as to 
to their position, and thus 
pread. Further, in general, 
ere were significantly fewer 
ation neither did the other, 
Ithy, and so the black and 
on the lattice. There could, 
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.however, be cases, such as those of a complex of viruses, where more colours were е” 
Also Shas could be cases where some of the plants were carriers of the disease withou 
знане symptoms themselves. It would then probably be desirable to treat such plants 


as vacancies from the spatial point of view but as diseased neighbours the previous year in 
the temporal approach. 


4. SuMMARY 


This paper considers the spread of disease in a rectangular plantation where there may И 
some missing plants, or vacancies. The problem is considered from two points of view, т. 
апа temporal. In the former, formulae аге given which modify the existing methods “oa 
by Krishna Iyer to allow for vacancies. In the latter, the 2 x 2 comparative trial of Barnard 
is used. Both methods are applied to a particular example of the apparent spread of a hop 
virus disease, nettlehead. In this case the results achieved by the two methods supplement 


each other, and, in general, both must be used before it can be concluded if the apparent 
spread corresponds to a spread of inoculum. 


Таш much indebted to Dr 8. С. Pearce for helpful discussion on the statistical aspects, to 
Mr J. Т. Legg for his ready co-operation on the plant pathology side of the study and to 
Dr R. V. Harris for bringing to my notice the example relating to Verticillium wilt. 
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TESTS OF SIGNIFICANCE FOR CONCURRENT REGRESSION LINES 


By E. J: WILLIAMS 


Commonwealth Scientific and Industrial Research Organization,’ Melbourne 


The fitting of a straight regression line to а set of observations on two variates is a straight- 
forward application of the method of least squares. Where there are several sets of such data, 
it is of interest to fit regression lines for each set, and to determine whether or not the slopes 
of the lines differ more than would be expected due to chance variations. Where the slopes 
of a set of lines do not differ significantly, the interpretation of the data is particularly 
straightforward, since the comparison of the sets may be made simply by comparing the 
constant terms of the regression equations. In fact, if these terms also do not differ signi- 
ficantly when a single slope is used for all lines, then the regression may be regarded as 
completely homogeneous, and a single equation used to represent the relationship for all 
the data. 

The case of parallel regression lines is particularly important in many applications; for 
example, the comparison of two materials by means of biological assay can be carried out 
Satisfactorily only when the regression of response (suitably measured) on dosage is linear, 
and has the same slope for each material. The relative poteney is then measured as the 
distance between the parallel lines, measured in a direction parallel to the dosage axis. 
When the regression lines are not parallel the interpretation is difficult or impossible. 

Another case which appears to be important but has not received so much attention is 
the case in which the regression lines, instead of being parallel, are concurrent. This type of 
effect is likely to occur when different materials produce different rates of response, while 
all give the same response at some fixed level. This level in practice is often not zero, and 
often not exactly known. When three or more regression lines are concurrent, it is possible 
to measure the difference between the effects of different materials or treatments by means 
of the ratios of the distances between the lines measured along an ordinate. It will be seen 
that parallel regression lines are simply a special case of concurrent lines; in fact, the ratio 
of the effectiveness of different materials may be measured in precisely the same way as for 
concurrent lines, that is, by the distances between the intercepts on a transversal. 

Tocher (1952) has recently described situations in which it is of interest to know whether 
the regression lines for different sets of experimental results are concurrent. He has given 
a test of significance of the hypothesis of concurrence, and a method of determining fiducial 
limits for the abscissa (i.e. the value of the independent variable) of the point of concurrence. 
The purpose of this note is to give what is considered to be a more satisfactory solution of 
these two problems, as well as to discuss further applications of the fitting of concurrent 
lines, 

When the point of concurrence of the lines is known in advance, or only the ordinate 
requires estimation, the problem is, as pointed out by Tocher, amenable to standard methods. 
When the abscissa requires estimation, the sums of squares required for the significance 
tests given here are found to be latent roots of a certain matrix, The analysis and tests of 
Significance are found to parallel those given (Williams, 19520) for the interpretation of 
Interactions in factorial experiments. 
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FoRMULATION OF THE PROBLEM 


К Я jt 
Our notation differs from that of Tocher, which is not always convenient for the qoum 
сло. We consider m sets of values of x and y; the pairs of values, n; in number, in 

ith set are denoted by v; and y;. We confine ourselves to the special case 


Ух; = En, } 
Уа = Хот, 
(2 and X? being constants; X +0 since the т, cannot all vanish), which includes the com- 


monly occurring case of all sets of data having the same values of x. Without loss of gener- 
ality, in the mathematical discussion we take 


(1) 


2-0, (2) 
x LY: =0. (8) 

We also put Xy; nj (4) 
yt; = nili (5) 

Ут = М, (6) 

У тд = №, (7) 


and take the point of concurrence аз (5.1). 


(i) Ordinate of concurrence given 
We assume that 77 is known, and 


only £ is to be estimated. The regression coefficient for 
the ith line through (Е, у) is 
p, = Хт) (%—&) 
i (а, $) 
= % $@ —7) (8) 
Ec 


Thesum ofsquares for regressions through (Ё, у) 


, With m degrees of freedom, is accordingly 
S= уљане) 


> паф— 2E > ^uqí(y;— 1) + 22 EnQ(yu- 9) 
Ж = | | 


sum of squares about 7 is independent of 2, 
S as minimize the residual sum of squares, P 


Since the total 


to estimate £ we may equally 
ut 


= В, En- = С. 
Then (9) gives 


(X? 2) 5 = AX*-2BX£4 ор, (9') 
and 8 is maximized if 29 = -ВХ +ОЕ (10) 
ог XS = AX — ВЕ. (107) 
From (10) and (10^) 


we see that the maximized g must be the larger latent root of the matrix 


a ral m 


a 
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The two latent roots, S; and S, may be given a practical interpretation. S, is the sum of 
Squares for regressions and the point of concurrence, and thus has m + 1 degrees of freedom. 
Sz, with m — 1 degrees of freedom, is the sum of squares representing departures from con- 
currence. Corresponding to the latent root S,, we have, from (10), 


У niq: —7) 


as n 
? S= Xini(y;—9) 


--5-0' аз) 
From the form of the matrix we see that 5, is zero if and only if 
4:0 9:—1, (13) 


and from (12), that in such cases the coefficient of proportionality is — Х2/2, Then b; = а] Х?, 
from (8), and the unrestricted regression lines for each set all pass through the point of 
concurrence. In particular, there is no departure from concurrence when either all the 4; 
are zero, so that the lines are parallel, or all the 7; = 7, so that the point of means is the point 
of concurrence. 


(ii) Ordinate of concurrence to be estimated 


When 7 is not given, it may be so chosen as to minimize the residual sum of squares. 

We then find 
q = g| X*. (14) 

But 9/Х? is the average unrestricted regression coefficient for all the data: therefore, as 
might be expected, the point of concurrence lies on the line of mean regression when 7 is 
estimated from the data. It is therefore appropriate to consider departures from mean 
regression only in testing the significance of departures from concurrence. 

The sum of squares for departures of individual regressions from mean regression is 


8' = (и: - Nb?) (X2+ £2) 


Xnj(q;— 9)? — 28 > ту; — 9) +E Xi wi 
P X24 22 


A'X?—2B'X£ 4 C'£? 
- X24 22 (say), 5 (15) 


with ж— 1 degrees of freedom. As with S, the maximum value of S" is found as the larger 
latent root, with m degrees of freedom, of 


A В" 

Er logit (16) 
Again, the smaller latent root, with m— 2 degrees of freedom, provides a test of departures 
from concurrence. The effect of having to estimate 7 is merely to replace the individual 


Means and regressions by departures from overall means and regressions, and to reduce 
by one the degrees of freedom. 
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TESTS OF SIGNIFICANCE 


In testing the significance of the differences among regression lines for several sets of data, 
the usual approach is to examine, first, whether the slopes of the lines differ significantly, 
and secondly, whether, if the slopes are assumed not to differ, the distances between the 
parallel lines fitted to the data differ significantly. For the present purpose, the two aspects 
are viewed in a different way. It is seen from the above discussion that the sum of the two 
latent roots involves both the differences among regressions and the differences among 
means, but that the larger latent root, representing differences among regression lines, may 
involve one or other aspect alone. 

The testing of significance of the latent roots of a matrix has been discussed elsewhere 
(see, for example, Williams, 1952 а, b, and references). The kind of test which seems appro- 
priate to the present problem is one which, first, tests the validity of the assumed model, 
ie. whether or not the departure from concurrence is significant, and secondly, gives 
fiducial limits for £. We first consider case (i) above, with 7 known. Corresponding to 2 


given value of £ we may determine S, the sum of squares for regressions through (2,7) 
then calculate the quantities 


(S, — S) (S— 5%) 


aaa == Pn 
8,8 
ала ъ= g. (18) 


These are distributed as sums of squares with 1 and m— 1 degrees of freedom respectively: 
v, gives an exact test of departures from concurrence, corresponding to £, and v, a test of 
concordance of the point of concurrence (2, 7) with the data. Provided v, is not significant, 
v, may be used to give a fiducial limit for S at any given probability level, from (17). 

It should be emphasized that, although it is often still possible formally to determine 
fiducial limits for even when 2, is significant, such fiducial limits for the abscissa of соп- 
currence are irrelevant, since the hypothesis of concurrence is no longer tenable. It is 
therefore important to test v, first before considering the estimation of Ё, 


The corresponding fiducial limits for £ are derived from equations (9) and (17), and are 
most conveniently given as follows: | 


ES- > nij; —3)] + 2£ У па —3) + X28 — Ут = 0, 
> паду — 17) + X /($ъ,) 
7 S= È (g=) 


— ХОВ + у/(®ь,)) 
> io БА 


Еш 


Alternatively, since v, is а sum of squares with 1 degree of freedom, its numerator may be 
expressed as the square of a rational function of £. Since, from (12), v, vanishes when 


__ ХБ 
AE I 
XB 
or {= 
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i S,—C)£—- XBP[(C—S,)) E- ХВ] 
it a К 1 $ о 
ppears that Su, В+ 
[РВ + ЕХ(С— A) - Х2В]? 20 
(X? 22)2 * (20) 
The non-linear analysis of variance may be set out as follows: 
аа, Sums of squares 
Point of concurrence 1 Виза ra) 
Regressions through point of concurrence m © 
Departures from concurrence т—1 Am 
Total between lines 2m S+S: 
Residual N—2m by difference 
Total N УУ (и.-т) 


When m = 1, 5, = 0, so that the test for a given point of concurrence reduces to a test of 
Significance of the difference S, — S. 

For case (ii), when 7 is to be estimated, a similar approach may be used. Here, however, 
We require to determine simultaneous fiducial limits for £ and 7. The limits will be given as 
а closed curve surrounding the optimum point. 

If the variation of 7 about the value given by (14) is not considered, we may ealeulate the 


Statistics (81— 8") (S' — 82) 
р (21) 
S 
, _ 515% 
апа ty e» (22) 


With 1 and m — 2 degrees of freedom respectively. As for the case when 7 is known, these two 
Statistics may be used to test departures from the abscissa of concurrence 6, and departures 
from coneurrence, respectively. 

Now for the departure of a given value of 7 from its optimum value Zg/ X?, the sum of 
Squares, with 1 degree of freedom, is 

N(yX* — $q)? 
X2(X24 2) ° (23) 

For a simultaneous test of significance of the concordance of 6 and у with the data we 

тау therefore take N(yX®—£q)? 


t Хх) 


S18; М(Х — £q)? 


= 8-87 Si Еј 
= й+&-#- з 
= Vy + V_— V2 (24) 
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This is a sum of squares with 2 degrees of freedom. The set of values of £ and 7 giving а: 
constant value to (24) are the required fiducial limits for the position of the point of 


concurrence. 
COMPARISON WITH TOCHER'S RESULTS 


The present paper gives significance tests which are exact. and more detailed than those 
given by Tocher. In fairness it should be remarked, however, that we have treated only the 
special case defined by (1), and that there are unlikely to be exact tests for the general case 
treated by Tocher. As there are similarities between the two approaches, it is of interest to 
compare them. For convenience, we rewrite some of Tocher’s results in our notation; his 
equation numbers will be distinguished by the suffix T. z 
Tocher first tests the significance of the restraint that the lines be concurrent at a point 
with abscissa £. The criterion is a sum of squares with m degrees of freedom, which is 
identical with our S,--8,—8 = +, (25) 


Its minimum value, which Tocher takes as critical for the test (14T), is clearly Sz, which is 
reached when S = S,, and & has its optimum value. S,, however, is not distributed as a sum 
of Squares with m degrees of freedom, as Tocher assumes; nor would a reduction of degrees 
of freedom to m—1 be correct, since the equation for optimum £ is not linear. Tocher’s test 
is, in fact, conservative, as he intends; that is, the significance of departures from con- 
currence tends to be underestimated (see Williams, 19524, p. 23). 

The optimum value of £ is that for which S = S,. For fiducial limits, Tocher uses the 
values of £ corresponding to a value of 5 making v; +v, significant at a given probability 
level (15T). The values of 2 are in fact given by (19), with the appropriate value of S. We 
propose that the setting of limits be done in two stages. First, corresponding to any 5, va 
representing departures from concurrence, is tested for significance. If v, is not significant, 
fiducial limits for £ are determined from the value of v, significant at a given probability 
level. If v, is significant, the hypothesis of concurrence must be rejected, and no meaning 
can be attached to a point of concurrence. 

Either method of setting fiducial limits is theoretically correct. However, it is con- 
sidered that there is a conceptual advantage in distinguishing between the test of the 
hypothesis of concurrence and the test of a given point of concurrence. When % is small, 


the use of v, also gives narrower fiducial limits than V4 +02. This point is illustrated in the 
numerical example given below. 


NUMERICAL EXAMPLES 
(1) Re-examination of Tocher’s data 
In order to compare the results given above with Tocher’s, we now carry out a comparative 
analysis of his results. It is noted that he takes у = 0. From his data (p. 115) we find 
A = 0-1013 5806 6, 
В = 0-9224 9221 3, 
С = 8-3960 7140 0, 
S, + 8; = 8-4974 2946 6, 
4,8, = 0-0000 1767 7, 
©, = 8:4974 2738 6, 
S, = 0-0000 0208 0. 
Residual mean square = 0-9223 1 x 10-6 (6d.f.). 
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Since the 5 and 1 % points of F, with 1 and 6 degrees of freedom, are 5-9874 and 13-745 
respectively, a sum of squares with 1 degree of freedom, to be significant, must exceed 
5:5299 x 10-8 (5 % level), 
ог 12.677 х10-8 (1% level). 
Since, for all values of S considered henceforth, the value of v, is not significant, departures 
from concurrence will be assumed non-existent. 
We now determine fiducial limits for S, and hence for 2, from the equation 


. 0-0000 1767 7 
v, = 8497429466—5— © 5 OIT = 55222 x 10-5 


and = 12-677 x 10-5. 


The fiducial limits are as follows: 


Present method Tocher’s method 
= = 1 - —— 
S | 5 5 5 
қ Е | rere ПАЗАРИ A ас а. " 
8-4974 8:4974 

Optimum value 27386 981-5 27386 981-5 
5% 21864 970 and 993 | 17899 968 and 995 
1% 14709 964 and 999 | 07234 961 and 1002 


The limits by Tocher’s method, given for comparison, are found from (15T), or from 
equation (19) with the appropriate value of S. These ranges are 17 % wider than those from 
the present method, owing to the inclusion of v, in the determination of 8. 


(ii) Data requiring estimated ordinate 


In a study of the effect of basis weight on the properties of laboratory-made sheets of 
Paper, three batches of pulp were taken and beaten for different lengths of time. From each 
batch, sheets of six different basis weights were then made, and mechanical tests carried 
out on them. The burst strength results from this experiment are set out in Table 1, together 
With values calculated from them. 


Table 1. Burst strength results 


Beating (revs of Lampen Mill) 
Basis т Total 
жеш 1125 4500 12730 
$=1 іш? i23 
10 = 1-73 1:98 3-11 
20 23 4-99 9-69 19-83 
30 car 874 17-26 22-69 
40 1 12-60 24-52 32-45 
50 3 17-04 31.36 48-22 
60 5 21-18 43-29 59-94 
бу; 66-28 128-10 179-24 373-62 
ба; 137-26 278-82 400-08 816-16 


20-2 
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Table 2. Test of departures of regression lines from origin 


Degrees Sums of Mean 

of freedom squares square 
Correction for mean 3 8821-604 
Regression through means 3 3666-356 
Sum 6 12487-960 
Regressions through origin 3 12311-932 
Departures from origin 3 176-028 58-676** 
Residual 12 28-554 2.3795 


** Significant at 1% level. 


A preliminary analysis, as given in Table 2, shows that the three regression lines of burst 
strength on basis weight depart significantly from the origin. In order to compare the 
trends for different amounts of beating it would be desirable to base the comparison ол 
regression lines through some other common point, if such a model were concordant with 
the data. The common point would represent the basis weight for which the value of burst 
factor was independent of beating. The ordinate as well as the abscissa of this point will be 
estimated. 

For the independent variable we take 


нн basis weight — 35 
5 > 


so that æ has the values — 5, —3, — 1, 1, 3, 5, and X2 = 35, The quantities required for the 
analysis are given in Table 1. 


The mean burst strength is 20-76, and the mean regression coefficient on x is 


816-16 
210 


— 3-886, 


so that, by (14), the optimum value of the ordinate corresponding to £ is 
7 = 3:886£ + 20-76. 


The matrix (16) is | 494-36040 — 726-07978 
—726-07978 1066-49853 |» 


whence Si = 1560-83437, 
S= 0-03356. 


The optimum value of (5,7) is (— 5-02, 1-26) from (12) and ( 
currence are clearly not significant, so attention may be соп 
determination of the fiducial boundary for (Е, 
Table 2, the values for v, + v 


14). Departures from cor- 
fined to v, 4-v, — v; and the 
à у 7). From the residual mean square given iP 
— v; required for significance may be determined as 

18-490 (5% level), 


and 32-964 (1% level), 
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The equation for the fiducial boundary is, then, 
| (726-122 + 19542 — 8471)? 
(£2 4-38) (106622 — 49602 + 5768) 
, 2100 3-886£ — 20-76) 
P) 


- 218-490 and 32-964. 


Of particular interest are the maximum ranges for ё and 7. The maximum range for £ is 
given above when » does not depart from the value given by the mean regression line. Then, 


f 
тош (19), S = 1542.34, £— —6-4 and —40 (5% level), 


S = 1527-87, &=—70 and — 3:7 (1% level). 


The maximum range for 7 is given when 2 has its optimum value — 5-02; negative values 
being inadmissible, the limits are found to be 


0 and 31 (5% level), 
0 and 3-7 (1% level). 


~. 


| CONCLUSION | 
The main emphasis of this paper has been to show that the concepts developed for tests of 
Significance in multivariate analysis and for interactions may be carried over to dealin g 
With a problem of concurrent regressions. It would clearly be possible to extend this work 
to regression analysis with two or more independent variates. The quantities required for 
testing significance would then, for k variates, come out as latent roots of a (k-- 1) x (k+1 ) 
matrix, Such a generalization, however, appears to lack practical interest. 
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APPROXIMATE CONFIDENCE INTERVALS 
IL MORE THAN ONE UNKNOWN PARAMETER 


Bv M. 5. BARTLETT 


University of Manchester 


1. RETROSPECT 


In a previous paper (Bartlett, 1953; this will be referred to as I) it was shown that the 
approximate confidence interval obtained from the asymptotic normal distribution for 
2100, where L log p is the log likelihood depending on a single unknown parameter 0, 
could usually be improved by the use of skewness or further moment corrections. The 
purpose of the present paper is to examine the further problems that arise in the case of 
more than one unknown parameter. It is perhaps advisable to point out that while the 
direct use of quantities like L and its derivatives permits a fairly flexible sampling basis, 
the asymptotic normal distribution remains the basis of the approximations. We may note 
(see §7) the application of the proposed method to non-classical problems such as time-series 
analysis. It could not, however, be used in problems for which the central limit theore™ 
did not apply. This remark appears relevant if we consider (see $$5 and 6) some of the more 
controversial problems involving nuisance parameters, as R. A. Fisher's fiducial probability 
solutions have been given by some writers (see, for example, Barnard, 1950) an approximate 


frequency interpretation by means of special (and in my opinion rather artificial) sampling 
interpretations. 


2. SIMULTANEOUS MOMENTS 


The systematic method of obtaining relations between the moments of 91.190, 221,003, ебо.› 
is obviously extensible to the case of several unknowns 0, (i = 1,2, ...). The derivation 
of the more general formulae rapidly becomes rather tedious, and I shall only record the 
formulae up to the third-order moments of 91190;. 


> Uy On the usual assumptions about 
differentiation with respect to 0; commuting with the integral sign, we may write (cf. I 


=f 0;--т; = f б ех | ; ab | eL 
> 4) р(0,) exp Zengg ЫР СЫ 5 
oL ЭГ, ЭГ, 2 
whence в(%) =i | = Б — oL 
(6, 7 ^^ 220,20, Боки - 35,57]: 
ЭГ, oL ЭГ, 931, oL 921, 
ШЕКТЕ Т 
20,90,20, 20,20,20,) “100, ә0;ә0,) _ (ag, 3525.] - (% 5.22] 


Differentiating the second-order equation, we see further that 


2 | eL aL OL aL L 
E ІҢ нда, ob 9L ƏL 
06, 00,20, ^ (00,00,0,] – \20,00, 80,| + 50725: = = 


T. 
y controversial fog. I would, howe Ye 
Behrens-Fisher problem, if deliberate" 


negligible, but this is just the limiti i i Е e > 
aa а gugible, 1 miting situation for which we do not require small-sample theo 


ыы. 2—11: == 


—— = 
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From these last two equations, we find that 

OLOLOL| _„„|_ CL д д 2 

00,20; 60, ) CAAA 00; In gg lat 50, 1, (1) 
where Г. = E( —0?L[00,00;). Formulae of the same order involving less than three para- 
meters may be obtained from (1) by contraction (i.e. making two or more 6’s identical). 


3. CONFIDENCE REGIONS 


As already noted in I, the extension from approximate confidence intervals to confidence 
regions is in principle straightforward. Thus for two parameters 0, and 0, our first large- 
sample approximation is to consider the joint quantities 0/00,, 61.190, as normal with zero 
means and known covariance-matrix. From these we may, for example, construct an 
approximate y? expression, writing this (for convenience later) in the unsymmetric form 


ЈА oL АШ м : 
(а) [7.7 aw) 171) (2) 


Values of 0, and 0, for which the expression in (2) remains less than 42, the critical value of 
Х for two degrees of freedom, define an approximate confidence region. An even simpler 
expression asymptotically equivalent to (2) is — 2[L — Lmax ], discussed more fully later.* 
It is, perhaps, hardly necessary to consider any skewness corrections to (2), for in it only 
Squares of the approximately normal quantities are involved. Any skewness corrections 
would merely help to locate the minimum and maximum ‘admissible’ values of 0, and 0, 
more symmetrically (in terms of probability contributions to the total probability of error) 
However, to illustrate the procedure these corrections will be deduced. Consider 


oL Ы 
"0,0-7 + (22) -m 
LOL 1,21 aL\? aLL орла 
T6, 01) =59 1.00, + Aw (% li жан 20, 00, Lie «| (220) -t |. 


In order to replace the random variables in (2) by more nearly normal variables, we require 
the third-order cumulants of 7; and 7, to be zero to the next-order approximation. This gives 


Ко + 6л, = 0 (ef. I), 


(3) 


Жы ы 
(в = г кы) + 223g + 2А. ћу he + А» = 0, 
11 


1, Hn 12 A 4 
(в- 27 Ker tm кы) + 2 А (А Leo — Tin) + ШЕЛ! 99- 7) =0, (4) 
11 11 a 
Lis Iis dis IB 
(e 93r 7, Као) + 6Ago( Љо зе е 0. 


The last three equations may be solved successively from the bottom to the top equation; 
1n the particular case when Zx = 0 they become 


Koy + Wel}, = 0, Ki 21111» = 0, Kog + 6А» 3, = 0. (5) 
* Ttis convenient to defer discussion of this alternative until §8 


likelihood estimates has been necessary, although we shall find th 
from §4 onwards, 


» aS 50 far по explicit use of maximum- 
eir introduction in general unavoidable 
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Thus for a normal sample with 


т т?(2-0)2 l(n—1)s? (6) 
sS ~ E ог "аш 
OL  n(r—0, ƏL т Hm 1) $c n(z—0,)* 
ваша 00," 0. a O 563 | 
OL n OL "(2-60)) eL к. (n — 1) ?--n(z— 0, 
20: — 9,’ 0000, OF ° 7947794 8 
n n 
1, “> ће = 0, I, = op 
и=р› із 2 = 2% m 
n 
Ку = 0, Қат 6% Кү = 0, коз = 0% 
1 20, 
we put А = Ay = 0, Лр=— Эл? Ло = S 


m n Q(n—1)s8-mn(z—0,? 1 ГаҚа-0)% m _ 20, (aL\* al 
QM Mel 20% m|| 08 0,17 3n | WM, ~ 203 


Now as the crude approximate region is given by 


т2— 01): 20: a) = 5 
n ta om) = № 


we may write in the next approximation 


т (п-1) 1 1 1 п(2— 01)? 
Т, = – ~ ge ИЕ а] 
37787 2 +30, 30, 30, (ж 0, | 


7-1 1 та-0,)% 8) 
= ee ER TA 1^ 
20 P aeg [1+ б, №], | 


obtaining as the next-order approximation for the confidence region 


т—0,)2 202 
EXP MM, @ 


where 7, is given by (8). As an obvious check 


on this result, we may note that if we had 
initially made use of the facts that z and s? 


are sufficient independent statistics for 0; 2” 


would be that the coefficient of T2 in (9) would be 263/(n—1), this difference arising from the 
neglect, justifiable to the order of approximatio | 


variance of T, due to the third-order moment corrections 


4. THE CASE or * NUISANCE PARAMETERS’ 
We come now to the more formidable problem of ‘nuis 


en» 
; ance parameters’. Т. ur тор! 
consider first the case of two parameters 0, and 0 parameters’. То fix our p. phe 


» With 0, the nuisance parameter. 
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quantity involving 01/00,, 0 L[00,, which provides an alternative to the maximum-likelihood 
estimate 0), is 9L 1.01 

00, 1,00," 
with variance Л, — 713/15. = ћу, say. But only in large samples, when the expression in (10) 
апа 8190, become uncorrelated normal variables, with 22/20, an effectively sufficient 
statistic for 0, can we expect the standardized variable 


_ OL 1,21; 
т-(%-1 8) [ ts (11) 


to be completely insensitive to the estimate we substitute for 0.. In smaller samples this 
substitution will affect the distribution of 7. In principle this effect can be successively 
allowed for, this being the proposed procedure; the effect of the correction at one stage may 
be reconsidered, a further correction of a higher order introduced, and so on. As a general 
method, this is, however, evidently going to become rather intractable after the first stage, 
for the previous advantages of working directly with the derivatives 01,/00,, 01100, are іп 
general lost. 

It is not possible to ignore the sufficiency properties, if any, that exist for 0,. For if 
21100, is equivalent, even in small samples, to a single sufficient statistic for 0, (when 0, is 
assumed known), we can* proceed from 7 by building up a polynomial series solution in 
91100, апа 0100, which is statistically independent of 27/20, and hence mathematically 
independent of 0,. But if 2/00, has not got such sufficiency properties, it can only be con- 
sidered ‘sufficient’ for 0, if regarded as a random function in the parameter 0,, and statistical 
independence merely of the random variable 21120, would appear to be no longer adequate. 
I thought at one stage that this difficulty might be overcome by eliminating also the 
dependence on higher derivatives such as 0?L/002, but this may be shown (by a counter- 
example, see 56) to be impossible in general. In the general case it seems therefore necessary 
to fall back on the very cumbersome substitution procedure referred to in the first paragraph 
above. 

Fortunately this substitution, if the maximum-likelihood estimate of 0, for given 0, is 
used, only affects the distribution of the statistic 7 to the order of approximation O(1/n), 
Where n indicates the sample size, a stage higher than the skewness correction, which is in 
general of O(1/,/z). For we may write the maximum-likelihood estimate of 0, (for given б,) 


(10) 


approximately as " aL а 
295, 22» 
ðL T 
80 that БЫА 1». (12) 


The estimated value of the second term in (12) is of course zero. Its sampling effect on Т, jn. 
however, be investigated. The first correcting term іп E((7;y) is 


aL oT . p [2L er" / | 
(тае |ы == 5(% 20, toa 


д 7| | Bt (2) | D с] , 9L д1, 
= oar la a) | o] Вт оз hs mu a 3 | ты). 


* (Added in proof. I do not mean to imply that we should, for in this case the probability of the 
Sample, given 22/20,, provides a distribution independent of 0, (cf. Bartlett, 1936), and the series 
expansion, illustrated by §5 on the ¢-test, is unnecessary except perhaps for finding approximations. 
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From the asymptotic independence of 7 and 81/00,, it is easy to see that all rie = 
vanish to the first order (for the second and third, we replace — 07/003 in the thir : не vd 
1„„ to the first order, and couple with the second term). Other asymptotically e eo 
estimates will be equivalent to the same order and may be used if more convenient; t 
effect at higher stages would have to be investigated separately. | 

For reference if required, the next term in the expansion of д0, gives 


oL oL E | ЕЗІ eL 
+ Ds 


вө, 002 1 202 | 208 7 20,) 905, (13) 
M T, 275, 
ab ar ar ere В (4) 
несе diu a.. ta+fag,+3\a0,) 20| > 


where / is the expression in curly brackets in (13). , 

A result of some interest which may be derived from an expansion like (13) is 
fı gf Л 2 - (19 
(E [oos] 59] | 79 


where the suffix 2 has been dropped for convenience, as (13) and (15) refer effectively 60 
a single unknown parameter. 


Ед) = 0— 


А al 
While it would be too laborious to carry through the complete details of the go 
correction to the O(1/n) stage, it may be advisable to indicate how this would proceed, 8 


that the iterative nature of the corrections is made clear. To О(1/ /т) we have 


S, = [T - y, (7? — 1)],, 
n A 44-4 f 
where y, is the skewness of 7’. Owing both to the properties of 8 and to the substitution ^ 
the estimate 5, for S, so that S, is related to S by an expansion similar to that for Z, in (14^ 


we have O(1/n) additions to the mean, variance, third and fourth cumulants of, 887^ 


Сі, Co, сз and c, respectively. Then a new quantity R correct to O(1[n) is (cf. Cornish & 
Fisher, 1937) 


Е-8,-с-1о, 8, — 10288 — 1) — паса (58 — 38, n 


the effect of further substitution for 0, in су, ср, с; and c, will not affect the distributio 
R, before at least O(1/ni). 


As noted in I, a statement of 


n of 


5. THE і-ТЕЗТ AND THE BEHRENS 
раат o о. demonstrate the Procedure first in the case of the standard 
t-test, even although not only do we not require it here b jt do 

i ut fi arksi 
not illustrate the completely general case. Tom the above rem 


From the expression (6) we obtain for the quantity Т defined in (11) 


-FISHER PROBLEM 


т 1—0) 


2 
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From the sufficiency property of 0/00, in this example, a successive elimination of the 
higher cumulants between 22/20, and the modified series 


n fos 9Г,\? oLoL CL \2 | 
(ата fat мы) (0) I | 20, 35,4] ны (33) In| ete 


will eliminate the dependence 017, on 0,. We choose the jv’s also so that Т, becomes normal. 
The resulting equations are similar to (5), being 


Kao + бр у = 0, ка +2 А = 0, Kis Sus = 0, 


OF flog = 0, ду = — бајт, оз = 0, во to the next approximation stage 
Am(& — 0,) 1 ((n — 1) s?-- n(z — 0,)? 
== Uy а де коны i. 
Т (1-5 n0, i) 


(та-0)) 
("= 1) ‚тщ IY. 


~ 


This result is not of course yet correct to О(1/т), but there is little point in continuing 
further with the corrections. If we make use of (12), we reach the same result to this order 
more simply. Higher approximations would merely develop the Fisher (1941) asymptotic 
expansion for t. 

Та the case of the Behrens-Fisher problem of the difference of two independent normal 
variables with separately unknown variances, we have 


l(r—04? m, lov, ms 17%, 
1 = – Пов (0+0) 5790. 2 ОЕ $ 3186-375. (17) 


where to correspond with the degrees of freedom n, n, for the estimated variances V3, Vo, 
the unknown variances of 2,, v, are taken as 0, and б, and 0, for the unknown true difference 
corresponding to ==, — 25. 
We easily deduce that I a —0, _ 
A(8, 4-03)? 

and if we substitute the most convenient estimates 21, v, for 0,, 0, (these are not the Strict 
maximum-likelihood estimates for given 3, but are asymptotically equivalent as ћу, Ny 
increase), we arrive at the usual approximate t-quantity (see, for example, Welch (1947)) 
To reach a result correct to O(1/n) we should require (i) to correct the kurtosis of T (its 
skewness is obviously zero), (ii) to make the O(1/n) correction corresponding to the effect 
of the substitution of фу, v, for 0), 0,. There is little point in carrying this through, for it is 
clear that it would lead to the next order correction in the asymptotic series solution to this 
Problem proposed by Welch (1947). My impression (with which Dr Welch may not 
necessarily agree) is that his remarkably ingenious solution to this problem may be logically 
Classified with the further asymptotic expansions based on the normal approximation which 
are the subject-matter of I and the present paper. 

If we alternatively tried to extend the first procedure illustrated for the t-test, to cover 
two nuisance parameters 0; and 0з, we should eliminate the dependence on both 01100 
91190,. It has been verified to the first stage of the approximation procedure that this leads 
similarly to a consideration of the approximate t-quantity (a — 9.) (v4 +v), but we would 
not expect it to extend to higher stages for the reasons given in $4, as 0100, and 27,00, do 
not correspond to а pair of sufficient statistics for 0, and 0, (for given 03). | 


апа 
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6. FISHER’S ANALYSIS OF VARIANCE PROBLEM 


А simpler problem for which such difficulties may be illustrated is the second problem Cons 
sidered by Fisher (1935) in his first derivation of solutions in terms of simultaneous fiducial 
probability. This problem may be stated for our present purpose as follows. We have two 
X?-quantities "i fiada 
0 A0, 05” 
with degrees of freedom т and mp, and require a confidence interval for 0,. Here 
т _% v т Ny Va 
Lm ios Sg) ots 2 
— L(n) 4- L(n;), say, 
oL n 
88, ^ 2(0 + AG, pU - (a+ Аба), 
9L _ ƏL OL(nj) 
00, '00,' 00, ' 


NM (WES = EET п 
іше 2(0, + А0„)?? hoch, Tyg = A hi gr. 


whence after reduction we obtain 


mW E а. МЕ 8 
па и / 3[n,(0, + Аб,)#-+ Ар um 

While this problem appears at first sight rather simple, depending on two joint sufficient 
statistics v and v, for 0, and 0,, the difficulty arises from the absence of a sufficient statistic 
for 0; (for given 0,). Hence we cannot expect it to suffice to modify T to be independent öf 
01100, In this example we may notice further that it is impossible to make T independent 
also of, say, 22/208, for as Т, 2100, and 0%100% are all linear in v and v,, there is a linea" 
relation between them. Thus we return to the substitution procedure дөдбініндейікі in $4 

It will also be seen in this example that (as in the Behrens-Fisher roblem) the use of the 
normal approximation depends on two sample numbers not bein, Es mall, in this 0486 
n and ng. The best estimate of 0, for given 0, isnot Фу, but can be diva x n ‘a e equation 
21100, = 0 using v, as a first approximation. This leads to the sisse к= 


na An(v—4,)) [[m. An, 
reso eta] W 


We may not substitute the simpler estimate % (unless n, > n) 
the same order as the estimate in (19), it is not as a e 
Page . Э ymptotieally 1 pene 
the effect of the substitution will be of O(1/An, 1] dad "n А Fa пина of T and p" 

first stage of the correction. St also be corrected even 

The skewness of 7, unaffected to the first order by the sub tituti 
stitution, is 
0. 

у=8 Cat A 26 " 
ті ue 


(20) 


where Л.з is the expression under the Square root in (18) 


+; of 
› for though its accuracy 18 x 


= 
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Thus corrected to O(1/,/7, Мт) we may consider our confidence interval for 0, given 
by either of the two equations (сЁ. I) 
Т-фу(7%-1)- £y, (21a) 
T—iy608—1) = д, (216) 
where in (21а) and (210) it is understood that the estimate in (19) is written for 0.. An 
equation correct to O(1/n, 1/n,) would be rather complicated, for in addition to a correction 
for the substitution for 0, a correction for kurtosis would be required; this would make the 
total correction exceedingly cumbersome, as a bias, variance correction and the asymmetry 
must all be allowed for at this stage (see 54). 
However, from the check made in I on the accuracy of the confidence interval for a single 


variance, the use of equation (21а) or (216) would be not unreasonable for moderate-sized 
n and ng. 


Table 1 
= а X Lower and upper 
n Y quantities 0. v » | P = 0:05 limits for 0, 
ENTER = = — À 8 
e E. р = | 
0 0-980 0 0-55 (0-56), 2-32 (9-95) 
12 11-76, 13-43 2 1:960 А 2-240 | 0, 3:53 
7 { 4 2-940 4-476 0, 4-74 
0 0 0-72 (0-72), 3-00 (2-92) 
i52 bun 2 0-910 0-91, 5-59 
4 1:820 0-86, 8-06 
0 0-308 0 0:14 (0-15), 1-28 (1:13) 
1:85, 7-80 2 0-617 2-600 0, 1:16 
9 a { 4 0-926 5-196 0, 1-15 
0 0-940 0 0-44 (0-45), 3-92 (3-46) 
5:64, 4-18 2 1-880 1:583 ‚ 7-09 
4 2-820 3-167 0, 10-32 


To illustrate and examine the numerical use of equation (216), it was applied to a few 
artificial sampling estimates in the case when there are only two observations per group, so 
that in the analysis of variance table the between-and-within-groups have an equal number 
of degrees of freedom, т. The true mean square between groups is 20, + 0, so to correspond 
to the notation above (with A = 4) we take as v one-half of the between-groups mean square 
and v, as ће within-groups mean square. Two cases were taken for n = 12, and two for 
n = 6.* In addition, the same effective Y? quantities were used when comparing the results 
for two different values of 0, (2 and 4), so that the extent to which the inference depended 
9n @, could be more easily seen. On this last point, however, it should be recalled that the 
Statistic used is only independent of our error in estimating 0, up to and including O(1 Idm) in 
distribution, and in any particular case (for which the error in 0, will alter with 0 
the same x2) will still change to an extent of O(1 Nn). 

oreover, the information on 0, is of course less as 0, increases, so the width of the 
interval should in any case tend to increase with 0,. For comparison the case 0, 


2 even for 


= Ohas been 

* It had originally been intended to examine ten cases each, but as this would still be far from 
Suffieient to test the approximation adequately, it was later thought hardly worth while. The sample 
quantities v and v, made use of were partly selected from the larger sot as interesting combinations to 
try out, and thus are not entirely random. 
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included, as this reduces to the single variance problem discussed in I, and the exaob 
confidence interval in this case can also be given in Table 1 (in brackets). The appreximate 
intervals for 6,+ 0 were obtained from equation (215) graphically, plotting against 0, for 
the lower limit and 1/0, for the upper limit (these scales are indicated by the form of x 
equation at the permissible extreme points 0, — 0 and 1/0, — 0). The true value of 0, through- 
out was unity. 

The lower limit for 0, will often be the minimum permissible value zero, rather than the 
value corresponding to д = — 1:645, which may well not be reached, especially for small 
n and large 0,. This is not unexpected, as it is well known that even the strict maximum- 
likelihood estimate v— Av, in this problem may often turn out to be negative; it implies, 
however, that we are not making full use of the 0-05 risk of error at the lower value, so that 
it seems useful to think of the error risks for the lower and upper limits separately.* 


7. APPLICATION TO TIME-SERIES ANALYSIS 


16 has previously been noted (Bartlett, 1952, at end of paper) that the technique шау 
be applied to unknown parameters іп time-series. We are now in a position to consider 
further the solution proposed there for the auto-correlation of a normal Markoff process» 
in discrete time, and to give a modified solution corrected to O(1/,/n). The quantity 2 18» 
apart from the end-correction (which will not affect our solution until O(1/n)) 


lèy 22 
L=-5 logo} -> X Yyot, (22) 
2 9. 
where Х,- ВХ, = F. 


(For simplicity we assume, as before, that E 


{У} = 0. In (22) the observation X, has been 
discarded.) We have} 


oL _ 
ap = XS PX a) X, Joy, 


n n 
ta = тој 1 = 0, = 

where 0, =, 0,=0%,. We have further 

L Ж Oh, nf 

(P ^C др = 
whence Е (4) | _ 6nf 

of, a- 
We note finally that 6% = У X3(1— f)In. (23) 
т=1 


Hence our approximate confidence interval for f is determined from the equation 
"n 
AS-A 0, " 
moms - Ad" ЖИН 
Мега — 25) Ма Рај =. ( 


TET (21) at 0, = 0 is (Ао v-- Авр), which must be great?” 
than / for a non-zero wee limit. When Av,/v < $, ib may ae | Раље ан relation W? h 
0, breaks down near 0, = 0, but of course if 0, is near zero dv, /v~ 1 +0(1/4/п), so that no difficult 
should arise within the ‘admissible’ range of 0, except perha; T у ТЕ 

T In equation (25) of the paper cited, o% уу; ps for very small n. 


as i Р -— ct the 
deduced confidence interval equation (26). 8 inadvertently omitted, but this did not affe 
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It is interesting to notice the appearance of a non-zero skewness correction, in contrast with 
the classical regression problem. 

When we are able to assume normal residuals (or some other complete specification of the 
process) similar methods can clearly be applied in other time-series problems. 


8. RELATION WITH GOODNESS OF FIT TESTS 


It was pointed out in $3 that an alternative to the approximate y? expression in 0100, 
would be (о use —2(L— i where Lmax, denotes the maximum of L for choice of the 
unknown 0,. This last expression is more familiar in the use of the X? test as a goodness of 
fit of hypothetical values of the 0,, but the two uses are of course ye connected. In this 
final section we shall demonstrate to the next order of approximation (O(1 (1/4n)) the asymptotic 
equivalence (in distribution) of this alternative 1? approximation,* equivalent to the 
likelihood ratio A and given by 


—2]logA = —32(L — Lmax.)- (25) 
Whether we make use of A or the general quadratic formula 
s roau aL 
у ~ 19 20; 20; > (26) 


where in (26) 19 denotes the inverse of J, ;j (the summation convention is used throughout 
this section), will thus to this order be a matter of convenience. Formula (25) is formally 
simpler than (26), but involves finding the simultaneous maximum-likelihood estimates д. 
We have (expanding 146, ;) about L(0;), rather than, as is perhaps more usual, L(0;) about 
140.) 


146.) = 10) + (0,0) 55+ WO. 0, (0,— кл => 
+40,- 0) бр 0,— 00 ag Fag > (27) 
t 7 4 


The relation between 0, апа С may be developed analogously to the case of one unknown 
quoted in §4 (equation (13)). We have 


aL oL 


0= + -0) a7 t Ê- – 93) 6.6.) обв. +- 


дг ?L BL 
-Ш. (0-01 „+(6- -0) (+) +86, —-0)(д,-— RN 


Denote the inverse of I; by 19. Then 


һәр, [9L [| 9L ] ƏLƏL PL 
I., | I" Iik [tk Thi Тат 777. 
бъ der a+ |9, 20,20) 9 ASTE 00, 00, 90,80,90,, || 


= то (28) 


oL eL hj Jik |. Atk] Гот — aL д1, L 
J |, Ес ui 00,00, 00,00,00,, |+-- 


* Fora discussion of the large-sample x? approximation for the likelihood ratio А (including the more 
Beneral сазе where L is replaced by its maximum Грах, With respect to a smaller subclass of the un- 

own parameters), see Wilks (1938), where it is shown that —2 log A is equivalent to a X? except for 
terms of О(1//һ). 


Say, where 
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ND „ав ee 1) 
—2(L—Dmax) = Тө. 98; [ag 36,1” 36, (8626; + 4 


ӘБ, @% ЖЕ ль. 85 95 
әк м 1 [kt 19 ~ [um . A — 
кен ей да * 90,1 09, 1^" 90, 90,20,98, 


ее 


т 


or if we put 021/00,00, ~ —J,; in the fourth term on the right, this may be written 


in OL oL аг дг ikJih eL ij loLoLoL PL ppp a... 
I 59,00, (6 20, ( I^ 58,00, +1") + 300,00, 00, 90.20.20. | 
ETT б) 
= ЖИГ ==. 
Petr 


say. Now as the skewness of 27/20), does not affect the distribution of the quadratic first 
term, it is, even to O(1/ n), a valid y? quantity, as we saw in $3. It remains to consider the 
effect of the remainder term R. Now although R is of O(1/,/n), we can readily show that 
ЕЕ), and E(I'*0L[00,0L[00; В} are of O(1/n), so that — 2(L — Lmax.) has the same mean and 
variance as 11%01|00,01,20;%о O(1/,/n). The same method can be applied also for the higher 
moments. То this order the same y? approximation may thus be used. Although these 
average values involve higher-order moments of L derivatives than we have listed, for 007 
immediate purpose we do not require their explicit values. We merely note that 


aL ab ( ә 
Ел (5,25, + ај < O(n), 


(60222. a eios 
90; 50, 90, 60,00, | vn), 


ƏLL\ ƏL oL ү BL 
" | (is 28) 00, 00,, (е 00, * in) | < O(n’), 


ЕЕЕ авар BL А 
00, 20,) 00,0, d0, 50,20, = < O(n’), 


and as I“ is O(n), the result follows. It is obvious from particular examples that 
—2(L— Laus.) does require modification to O(1/n) (so also does the alternative quadratio 
expression). In principle a general correction to O(1/n) could be worked out by the above 
methods, though we have seen that corrections at this order are rather formidable t° 
evaluate by the present general approach. Particular problems, e.g. in time-series analysis: 
might be somewhat more tractable if handled individually. 


By an obvious extension of the argument, the criterion — 9(1/ — —L ), where Газах, 
refers to a smaller subclass of the unknov ual ан 


H is- 
д vn parameters than Lmax, is equivalent iD di 
tribution to a Х2, also up to and including O(1/,/n). max. q 


I am indebted to Mrs A. Linnert for assistance with the computations for Table 1. 
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NON-NORMALITY AND TESTS ON VARIANCES 
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Imperial Chemical Industries, Dyestuffs Division Headquarters, Blackley, Manchester, 
and University of North Carolina 


1. INTRODUCTION 


For many experimenters the most commonly used statistical tests are those for comparing 
sample means and sample variances. The test used for the equality of the means of k groups 
of observations is usually the analysis of variance test (or equivalently the ż test when Ё = 2» 
whilst the test used for the equality of variances of k groups of observations is Bartlett's 
(1937) modification of the Neyman-Pearson (1931) L, test (or the equivalent ЈУ test when 
k = 9). 

The tests mentioned are derived on a number of assumptions, in particular, that the 
observations are normally distributed. Usually, however, since little is known of the 
populations from which the samples are drawn, these tests are used, of necessity, 25 if the 
assumption of normality could be ignored. So far as compar: 
concerned it appears (perhaps rather surprisingly) 
for thanks to the work of Pearson (1931), Bartlett (1935), Geary (1947), Gayen (19504 n 
David & Johnson (1951 a,b) there is abundant evidence that these comparative tests 00 
means are remarkably insensitive to general* non-normality of the parent population. 


It would appear, however, that this remarkable property of ‘robustness’ to non-normalit 
which these tests for comparing means possess, and with 


қ s 
i hout which they would be much ve 
appropriate to the needs of the experimenter, is not nec 


tests, and in particular is not shared by the tests for equal 
The sensitivity to non-normalit 


: are 
ative tests on means Т 
that this practice is largely justifiable» 


med by Geary (1947), Finch (1950) 
tis particularly sensitive to chang" 
ation у, у, ete., will be used for a 
otation y, = Ва, у» = 6, —3-) In " 


It is further shown that the ormality. 


nd 
test on means on the one Ра 


0 
. . ге 
ап essential difference in the natu! 


Е ggests ho NIA lit 
of the variance tests can be Süd Ма ен, № чиш. 


* By ‘general’ parent non-normality i " 
В f : à У is meant that th ity, i Қ” 
с abet ер different groups, as could е бейне ex нен Rho Din wee гей 
I Е in which sample me groups corresponded with different а грн inam or отр | 
а ст 1р ans аге compared, general ske "o pee s cde „ 
are found, however, if the Skewness is in different di muss ie fobs a d 


Irections in the different groups. 


— nó a—?€—XX 


Ин 
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2. DISTRIBUTION OF VARIANCE TEST FOR LARGE SAMPLES 


Suppose there are / groups of independently distributed observations with т, in the tth 
group and X m = N. The ith observation of the ith group is denoted by y; and the usual 


estimate of variance in the (th group, having фу = n,— 1 degrees of freedom, by 57. The average 
of the estimated variances is denoted by s?, where s? = У физ? /Ф and Ф is the total number 
of degrees of freedom У, = Ф. АГ 

The L, criterion due to Neyman & Pearson (1931) tests the hypothesis that the 
variances in the groups are all equal, given that the observations are normally distributed. 
Alternatively, M, the logarithmic modification due to Bartlett (1937), may be employed: 


М, = Olns*— Y; ¢,In 57. (1) 
t 


Following Neyman & Pearson (1931) it may be shown that when the appropriate null 
hypothesis is true and on the assumption of parent normality, JM, is distributed in large 
Samples as yz. , (where үҙ , means x? with 2-1 degrees of freedom). This is the basis of the 
test proposed by Bartlett (1937). He shows that, even for fairly small samples, to a close 
Approximation M, is distributed as (1+ А) yz 4, where A, an adjustable constant which 
tends to zero for large values of ¢,, is given by 


1 Ты 
4- жср Fa a): (2) 
It is convenient to define the quantity | 
1 @ = 
4-(: ы») 8 (3) 
When ф tends to infinity this becomes 
b= (1+ @ 


Thus д is an alternative measure of kurtosis and the quantities В» Уә and 6 are related as 
follows: 


Bs 1 2 3 6 © 
Yo => = 0 3 = 
5 со 2 1 0-4 0 


Denote by 83 (уз) an estimate of variance having ø degrees of freedom and based on 
Observations drawn from a population with finite cumulants whose measure of kurtosis is 
Ye. Then 82(0) is an estimate from a normal population. The estimate 82(у») has mean с? 
nd variance 20%/(4ф). Also from the general expressions for the higher eumulants of such 
ап estimate it is seen that as ф tends to infinity the distribution of 8%(у) tends to normality, 
and for sufficiently large values of ¢, (уз) is distributed like 843(0). 

| Denote by M,(0; ф, Фо, «++» Pr) the criterion calculated from normally distributed observa- 
tions and by My (узу Узв +++ Yoni Фа фз... Px) the criterion calculated from Ё groups of obser- 
Vations drawn from populations with measures of kurtosis y,,, Уә» +++» Уш... Yop. Then for 


la; 
rge samples My Уго +++ Yori Фив» --- Фр) 
is distributed as Ф (X: фе, (0/0) — 2) фа 88.800). 5 
t (5) 
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When the samples are sufficiently large we may write 6, for 4), and in the important = 
where the kurtosis in each of the sampled populations is the same, so that д, = 6 for all t, 
M, (723 Фф». --- фу) is distributed as 


5-Ң2Фіп4%,0)- > 3d, In «8, (0)} = dL (0; 644, дб... дё). (6) 
t 


Consequently №, (Уз) is distributed asymptotically not as y2_, but as (1 + 15) М1 for es 
parent distribution having finite cumulants. We see therefore that M, is asymptotical 
biased if yis not zero. In particular, in large samples the mean of the distribution curve к 
М, would be (1 + 472) (#— 1) instead of & — 1 and the standard deviation (1 + РЕЛЕ n 
instead of {2(k—1)}*. The discrepancy in means relative to the standard deviation woul 
thus become larger as k was increased. Consequently divergences from normal theory 
probabilities, known to be large when Ё = 2, would become progressively larger for > 5 
Table 1 shows the true probability of exceeding the normal-theory 5 % significance levels № 
large samples for various values of Ё and у, when the null-hypothesis is true. 


Table 1. True percentage chance of exceeding 5% normal-theory significance level of DA 
in large samples from non-normal populations for various values of ys 


No. of groups = k 
Y2 
2 3 5 10 20 30 
2 16-6 22-4 81-5 48-9 71-8 84-9 
1 11-0 13-6 17-6 25-7 38-9 49:8 
0 5:0 50 5-0 5-0 5:0 5-0 
EN 0-56 0-25 0-08 0-010 0-0004 0-00001 


: . d 
platykurtie populations real differences will te” 
to be masked. 

The behaviour of the test for means is in sharp contrast to that for variances discussed 
above. The appropriate test for the hypothesis—that the means in the groups are all equa” 
given that the variances are all equal, and that the observations are normal ly distributed 
is the within and between groups’ analysis of variance test. In its commonest form th 
criterion used is s 


F = (between groups mean square)/(within 


== > BP fo. 


t 


groups mean square) 


Alternatively, to correspond wit m 
aries а тайы pond with М, we could employ a form obtained from the 10521 
Ja = N- In (Le (— 1) jay. gy 
For large samples Neyman & Pearson (1931) showed that assuming normality a criterio? 


equivalent to M, would be distributed as xè. Wit р pct 
the parent population it is possible to sh ki h certain not very severe restrict 7 


3 oni 
9% that this result is true whether the population 


рт 
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normal or not. Thus for large samples the criterion M, for means tends to follow the normal- 
theory distribution under conditions of non-normality. This explains to some extent the 
insensitivity to non-normality found by Pearson and other workers mentioned above. In 
contrast, the criterion M, for variances follows a distribution directly dependent on y, 
however large are the samples. 

It is perhaps unfortunate that the test for homogeneity of means, and the test for homo- 
geneity of variances in the particular case of two groups, can each be brought to the form of 
à variance ratio test. This has sometimes led to apparently contradictory statements about 
the sensitivity of the ‘variance ratio’ to non-normality. As originally pointed out by Pearson 
in 1931, the two criteria are really essentially different in character but follow the same 
distribution when the paient is normal. The marked difference in sensitivity of the criteria 
is well brought out by an example taken from Gayen’s work (1950а). A test for homogeneity 
of means of five groups of five observations leads to an F test on 4 and 20 degrees of freedom; 
on the normal assumption this will be distributed in the same form as the F criterion used 
to compare the variances in two independent groups of 5 and 21 observations. Gayen shows, 
for example, that for a certain type of non-normal distribution in which y, = 0 and уз == 
the true probability of exceeding the 5% point for the test on means would be 4-5 po 
(à quite trivial discrepancy), whilst that for the test on variances would be 10-2 a 


3. DISTRIBUTION or M, IN SMALL SAMPLES FOR CERTAIN 
NON-NORMAL PARENT DISTRIBUTIONS 


It will now be shown that parent distributions exist for which relations similar to (6) are 
Obtained even for small samples. 


(3:1) Population means known 


For the moment it will be assumed that the means of the Ё sampled populations are 
known, Denote by s?, an estimate of variance having я degrees of freedom calculated from 
the deviations from the known group mean 7. The criterion calculated from such estimates 
is denoted by 1. If the distribution of the observations is such that 5° (Уз) is distributed 

ike 83 (0) for any sample size then 752, (ys) is distributed like -Iyin o, which implies that 
(y~1)? is distributed like 0-202. This is true if 

PY) = (8/252) (T (39)) | y— [> exp {—d(y—9)?/207} (-о<у< +оо; 0<б< oo). (7) 
This is a double y distribution, that is to say, it is a symmetrical distribution of the form of 
two y distributions having 6 degrees of freedom ‘back to back’. It has mean 7, variance g?, 
У = 0 and у, = 2(1-6)/6 (that is, 8-1 = 1--%у; as previously defined). When б = 1 the 
term in Iy-79 | vanishes and the distribution is normal. When 6 is less than 1, у; is greater 
than zero, and a leptokurtic distribution results. When 6 is greater than 1, У» is less than 
Zero and a bimodal platykurtic distribution is obtained. For such parent distributions it is 
easily seen by the previous argument that 

М (уз; My, ts; -+--> ng) 15 distributed exactly as 07M (0; O71, Ong, ..., дт), (8) 
Where Ny, Ng, ..., Mp refer to the sample sizes. 
(3-2) Population means unknown 


In 1931 Le Roux carried out a very painstaking research into the distribution of the 
Variance calculate d from small samples drawn from non-normal distributions of Pearson 
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type. He studied changes in the distribution of s? as the parent distribution changed, by 
calculating its first four moments from the moments of the parent distribution, and Солу 
firmed his findings by extensive sampling experiments. One of his discoveries was that 1n 
addition to the normal distribution a series of non-normal distributions existed (which he 
called D(s?) III distributions) for which the first four moments of s? were almost exactly 
those of a Pearson type III curve. These D(s?) ПІ distributions show kurtosis accompanied 
by marked skewness. A selection of these adapted from Le Roux's table are shown below. 


D(s*) ПІ distributions 
yi 0-00 0-10 0-20 0-40 0-60 0-80 1-00 1-20 1-40 
7. 000 -004 003 094 048 0-73 0-99 195 1-51 
For such curves Le Roux deduced that to a close approximation 82(у») was distributed 48 


82(0) for any sample size (where as before 47! = 1+ 5 75 7.) „а conclusion supported by 


his sampling experiments. We may conclude therefore that the relation (5) is approx 
mately true for samples of any size chosen from such populations. 

Tn particular, if the degree of kurtosis is the same in each of the populations sampled and 
each variance is based on the same number ф of degrees of freedom, then for D(s?) п 


populations 
Му; $, $, ..., $) is distributed approximately as 4-11/.(0; dd, dd, ..., do). 


Using Bartlett's method wesee that for D(s?) ПІ populations therefore M, (yaido Po? dx) 
is distributed approximately not as (1 + А) Ха but as й 


4-1 Ad3)j p 
where A is the correcting factor of order $^ defined in (2) 


from normality of this form would not need to be ver 
before the effect of the constant 4-1 completely swampe 


jure 
- We note in passing that departut 
У great or the sample size very 
d the effect of the correction term 


4. Two GROUPS or OBSERVATIONS 
When k = 2 we may use the variance ratio F as an alternative form of the M, oriterio"" 
Using the same argument as before and a c 


: r 
- orresponding notation it is apparent that 19 
samples of any size drawn from double X distributions with measures of oda You and ya” 


Pra Уз; 14, Mg) is distributed exactly as Ё(0; à n, б.п ) (10) 
i ? 19272); 
whilst for samples of any size from D(s*) ITI populations 
TY у»; $1; d.) is distributed approximately аз F(0; 4 фу d ds) (М) 
These results, which are, of course, equi › 2191, 05 2 | 


11) and the more exact values caleu 
hes considered the distribution of the "чи 2. P in (1950 b) and ar ie 
populations defined by the first few terms of Bageworth's ; ie equivalent z = ae 
that for these populations the effect of changes in y, is м Both these ч зо ай 
produced by changes in y,. In Fig. 1, Ут İS assumed ану in. lois йке т. жет" n 


ed by these authors, of exceeding the normal theory 5 % ро? 


lated by oth 


e о 
se ыы Su 
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for various values of у,. The values of y, are assumed to be the same in each of the two 
populations. The first graph illustrates the case фу = 24, ф, = 60 considered by Finch, and 
the second the case ¢, = 4, à; = 20 considered by Gayen. The dotted lines are the values 
given by equation (11) above. A further comparison is supplied by Pearson’s (1931) sampling 
experiments. Pearson drew 500 pairs of samples of 5 and 20 from six experimental popula- 
tions and calculated a criterion equivalent to Ј; this corresponds very nearly to the case 
ф, = 4, фу = 20 above (Ф, is actually 19 instead of 20). In his table Pearson shows the number 
of pairs of samples for which the criterion exceeds the 4-86 % point (which again is very 
close to the 5 % point considered above). These numbers reduced to percentages are plotted 
as circles about the second line. 


11 11 
10 | 10 
-24 %,-60 
9 Ф, 2 9 


со 
со 


и 7 
6 6 
РФ T 
5 5 
4 4 
3 3 
2 
: # = Gayen 
1 — Finch 1 # --- Equation (11) 
т-- Equation (11) “ © Реагзоп'5 sampling experiments 
Se eS MÀ d ee, 
- 0 1 -1 0 1 z 
у Scale of Y, Scale of Y, 


Fig. 1. Percentage probability Р of exceeding normal-theory 5 % level plotted against value of y,. 


The agreement between the simple approximation (11) and the values found by Finch 
is very close, as would be expected for the comparatively large sample sizes used, The extent 
of agreement with Gayen’s results is rather surprising when the small size of one of the sam- 
Ples is remembered. In both approaches it is assumed, however, that the higher cumulants 
are finite. The marked departure from both lines which occurs when this is not so is shown 

У Pearson’s sampling experiment from the type VII (i.e. the Student t) distribution for 
which у, = 4-1 and higher cumulants are infinite, 


B. SAMPLING EXPERIMENTS WITH A D(s*) ПТ DISTRIBUTION 
Confirmation of (9) in the case k = 20 was obtained from an experiment in which samples 
Were drawn from a population nearly of type D(s?) III form. Le Roux drew 1000 samples 
Of n = 90 observations from such a distribution in which y? = 1-0 and Ys = 0-8. (The dis- 
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tribution was actually a Pearson type I.) He calculated the quantity (п — 1) s?/n for each of 
the 1000 samples and published the resulting frequency distribution. The samples were 
grouped but the grouping interval was fairly fine. In the present work Le Roux’s observed 
distribution of s? has been resampled, the sampling being without replacement. In each 
experiment twenty-five groups of twenty variance estimates were reconstructed and from 
these, twenty-five values of M, were calculated. The experiment was then repeated using 
Le Roux’s observed distribution of variance estimates for samples of n = 5 from the same 
parent distribution. 


Table 2. Characteristics of observed distributions of twenty-five values of M, for k = 20 groups: 
with (а) n = 20 observations and (b) m = 5 observations, drawn from а Pearson type 1 
parent distribution for which уз = 1-0, y, = 0-8 


И mE 
| 
Expected on Қ Expeeted from 
normal theory | . Found in а tion (9) 
(x? approxi- samples of 25 (x? approxi- 
| mation) values of M, ы mation) 
т а ты | 
1 distributed as: | 1-018y2, 1-415 yj, 
үш 19-4 28-6 + 1-9* 2... 
Variance 39-4 93-3 + 30-37 76-1 
п = 20 4 = — ft |-—_ а = по === i 
Р 
No. out of 25 |19 те 13 (52%) 10-3 (41-2 %) 
significant p 9 (36 %) 7-5 (30-0 %) 
at P % point | 0:25 4 (16%) 3-2 (12-8 %) 
0-1 0-025 
Б 1 (4%) 0-9 (36%) 
M, distributed as; 1-088? | Ру 
Mean 20-7 es Я 14783 
Variance 44-9 280% L8* 28-0 
— 79-1 + 25-61 82-4 
: абы tees 
10 . 
No. out of 25 | = г 4 13 (59%) 9-6 (38:2 96) 
significant 1 QUE 9 (36 9%) 6-8 (27-1 %) 
at P % point | 0-1 0-025 2 (8%) 2-8 (11-0 %) 
0 (0%) 0-7 (2-9%) 


* The quantity following the + 
s.E.(Mean M,)={V(M,)/25}!. 


signs is the approximate standard error of the mean obtained from 
+ The quantity following the + 


The two observed distributions of M, are 
table in the Appendix. The main features oft tribut: in Table > 
It will be seen that the means and variances of the values of p e "и dee: sii with 

1 ате 1N close agreeme 
edly from those expecte 
fied. For example, both 


8, nin es 
° out of twenty-five (or 36 %) of the val” 


were significant at the 5 % point, 
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6. SAMPLING EXPERIMENTS WITH OTHER DISTRIBUTIONS 
Besides the population of the D(s*) ІШ type, Le Roux considered a variety of other popula- 
tions, mostly using sampling results used by E. 5. Pearson in earlier investigations, to 
determine empirically the distribution of the sample variance. In the present investigation 
Le Roux’s distributions derived from the following parent distributions have been resampled 
in the manner already described: 


Pearson typo и ПІ І ту VII 
(vi, Ya)* (0-0, — 0-5) (0:5. 0-75) (1-0, 0:8) (1-2, 2-8) (0-0, 4-1) 


From each distribution, twenty-five sets of twenty samples of 8? have been obtained, and 
for these twenty-five values of Jf, calculated. These observed values of M, are set out 
in the Appendix. Table 3 shows the values of the means and variances of M, observed 
and those expected on normal theory, together with the number of values exceeding the 
normal theory significance points. 

It is seen that large discrepancies from the normal theory distributions occur, and the 
discrepancies are in the directions expected. The discrepancies are larger when n = 20 than 
When т = 5, and for the populations studied the asymptotic result seems to give an upper 
limit to the diserepancy which is approached as the sample size is increased. The asymptotic 
result appears to be more rapidly approached when kurtosis is accompanied by skewness 
till for the skew (82) ITI populations, the asymptotic value is attained even for small sample 
Sizes, The asymptotic result is approached more slowly when kurtosis is large and for the 
very leptokurtie type IV and type VII distributions, sample sizes greater than twenty 
appear to be needed before the asymptotic values are closely approached. 


7. М, А8 А TEST FOR NORMALITY 


The seriousness of the discrepancies to be expected, even in small samples, may be арргесі- 
ated by considering JM, as a test for normality. 

Suppose a sample of N observations is drawn from a population distributed in the double 
X form of equation (7). We shall assume that the mean 7 of the population is known, so that 
Without loss of generality we may take it to be zero. We wish to test the hypothesis Я, that 
the distribution is normal, i.e. that ô = 1, 07 0, against the class of alternatives H, which 
Specify that the distribution is not normal, i.e. that ô+ 1, с> 0. Then, following through the 
method set out by Neyman & Pearson (1933) for the selection of best critical regions, it is 
easy to show that the appropriate criterion is Му = NIlnn(Xy?|N) —EIny?, and that if 
M a) and М ,(1—«) are the с: and 1 —« significance points for M ү, then the inequalities 

M,»M;,(x and М,<М (1—–а) (12) 

define a pair of common best critical regions for testing H, against the alternative hypotheses 
8-1 (у; >0) and ô> 1 (yc < 0), respectively, where in each case ø is unknown. Thus in the 
Particular case when there is only one observation per group, М. , is itself the most sensitive 
Criterion possible for testing for a certain type of non-normality. Non-normal distributions 
Of the class considered, which are defined by equation (7), would rarely be expected to 
Approximate to the distribution of actual data, but the above investigation suggests that 
We may form some idea of the sensitivity to non-normality of M , for small samples by seeing 
how good it is as a test for non-normality. 


* The actual standardized cumulants given above differ slightly in some cases from the theoretical 
Values due to the finite size (10,000) of the populations which Le Roux sampled. 
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(7:1) Comparison of M , with criteria for testing normality 
The usual test criterion for kurtosis is bs, the sample estimate of the moment coefficient 
В». Geary (1935a) proposed as a new test for kurtosis the criterion 


a = (sample mean deviation)/(sample standard deviation). 


This criterion had the advantage that unlike б», the distribution when the null-hypothesis 
of normality was true could be found fairly readily. Further investigation was carried out 
by Geary (19355, 1947) to discover whether the power of this test to pick out a departure 
from normality was comparable to that of 5,. Pearson (1935) studied this question by means 
of sampling experiments. One such experiment consisted of drawing ten samples each of 
seventy-six observations from each of three symmetrical populations: 


Description у yt 
Rectangular 0-0 —_й® 
Pearson type ҮП 0-0 Ре 
Double exponential 0-0 2-9 


* As before the actual standardized cumulants given above differ slightly from the theoretical values 
for the last two populations due to the finite size (10,000) of the population sampled. 


Pearson used his results to show the relative frequency with which the test criteria b; 
and a detected departures from normality and also to show the correlation between the 
results of the two tests. He published not only his conclusions but also the actual samples 
which he used and І have used these again to calculate the corresponding values for M ;. 
In my calculation of М; І have used the population mean (zero in each case), whereas in 
Pearson's calculations of б and а the sample mean was used. In samples of seventy-six, 
however, the discrepancy this introduces should not be large. A further difficulty arises 
due to the fact that the samples are grouped to the nearest unit, resulting in a frequency 
class having zero deviation from the mean which would give rise to an infinite value of M ,. 
To overcome this I have spaced the values evenly in the interval 0-0-0-5. Thus if there were 
% values in the zero frequency class these have been assumed to be at 1/(4n), 3/(4n), ..., 
(2n — 1)/(4n). Fig. 2a shows the thirty values of Му plotted against the corresponding 
values of b, and Fig. 2b shows the same values plotted against the values of a. It is seen that 
in each case marked correlation occurs, and if these graphs are compared with Pearson's 
Fig. 4 in which he plots а against b, it will be seen that the extent of correlation between 
М | and b, and between М ү and a is similar to that between а and 5. So far as is known, по 
tables are available for the significance points of М, (or for the equivalent criterion Та) for 
the case of seventy-six groups with variance estimates each based on 1 degree of freedom. 
Also the asymptotic approximations of Bartlett (1937), Hartley (1940) and Box (1949) 
break down for large k and small numbers of degrees of freedom. Approximate significance 
Points for M , were therefore calculated by considering the distribution of L71 = exp {M/k} 
Which ranges from 1 to co. General expressions for the moments of L, were given by 

eyman & Pearson (1931). From these the first four moments of L7 were calculated and 
the appropriate Pearson type curve to approximate the distribution was found to be of 
type VI. It was assumed therefore that Ly! was approximately distributed as сЕ, gy the 
Curve starting at 1. By equating the first three moments of the two curves the values , 
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Fig. 20. Comparison of M „у and а in samples of seventy-six. 
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с = 2-4705, фу = 67-1998, ф, = 45-2904 were obtained, and the. approximate significance 
points calculated by two-way, four-point harmonic interpolation in the Ғ tables of Mer- 
rington & Thompson (1943). 'The approximate significance points thus found were: 


Lower Upper 


64-1 12-3 121-2 133-3 


Table 4 is an extension of Pearson's Table VII showing the number of values of b», а and 
M exceeding their 5 and 1% significance levels. 


Table 4. Samples of seventy-six ; comparison of criteria 


Viens Rectangular Pearson type VII | Double exponential 

Distribution y2 = 00, у == 1-2 yi = 0-0, У. = 41 98-0 gre EO 

b. a M; b. a M b, " M, 

No. of samples: қ " М : А 
within 5 % limits 0 2 5 1 1 1 
botween Бапа 1 % limits 0 2 3 4 1 2 5 1 1 
beyond 1 % limits 10 8 4 4 6 3 4 8 8 
| 


For the rectangular and type УП distributions the criterion appears to be less sensitive 
to kurtosis than b or а; for the double exponential distribution, however, M , appears to 
be as sensitive as a and more sensitive than ba. It is not of course contended that М, is 
necessarily a practical test for kurtosis; we are concerned only to show that since b, and a 
are presumably very sensitive criteria for detecting kurtosis, M ү is much more sensitive 
even when the sample size is small than we should desire a test for homogeneity of variances 
to be. 

In this connexion it is of some interest to reconsider some data discussed by Bartlett & 
Kendall (1946). The results appear as & two-way table with fifteen columns and three rows. 
The entries in the table are the logarithms of variance estimates 52, each based on about 
48 degrees of freedom. The authors show that by taking logarithms the data are brought to 
à suitable form for the application of analysis of variance. Their analysis is as follows: 


Dd Biss M.S. 
Between rows 2 02001 0-1333 
Between columns 14 0-1047 0:0075 
Residual : 28 0-1005 0-0036 
"Theoretical variance m = 0-0020 
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The first three items are calculated in the usual way, and the theoretical реси 
based on the normal theory value x,(In s?) ~ 2/(ф— 1). Compared with this — Bh 
ance the residual mean square is significant (P=0-01). It is deduced, therefor * чари E 
is heterogeneity in the residual variance, and methods for further analysing t ra € (of 
heterogeneity are discussed. Although familiarity with the distri bution of their da | | 
which clearly a large quantity was available) no doubt rendered quite legitimate її hes 
example the assumption by these authors of approximate normality, it is perhaps W 
while emphasizing the dependence of this test on that assumption. For in general 


к(а 52) ~ 2/(dd — 1), 


. —— 1 8 
consequently a value for у, of 1:57 would have made the residual and theoretical ыы: 
exactly equal. Furthermore, since any value for the theoretical variance greater t s 1 
0-0024 would fail to show the residual variance significant at the 5 % level, any hypothes 


that the residual variance is homogeneous and Уз greater than 0-44 is not contradicted bY 
the data at this level of significance. 


8. OTHER TESTS OF VARIANCE HOMOGENEITY 


à r 
The reason for the difference in sensitivity to non-normality between the criterion M P 
testing homogeneity of means and М, for testing homogeneity of variances сап be seen 3 
follows. In the analysis of variance (F-form) the criterion for means may be written ap P bo 
where Р = (En(gy,—3)*/(k—1) is the between-groups mean square and О = s ды an 
within-groups mean square. When the null hypothesis is true, P and Q each provides * 
unbiased estimate of c? whether the Observations are drawn from normal populations от | E 
so that Q is always a standard with which P may be usefully compared. But if in Му Jk 
criterion for variances, we write 8? = о%(1 +z) and formally expand the logarithms, e 
obtain M, = $ У Ф(а)- 2)? plus terms in higher powers of 2. For sufficiently large weak 

t 
of ф, we can ignore the higher powers of x, and we have M(E— 1) plq, where 
P = {Zp (s? — 8) (6-1) and 4 = 204, 

Comparing these with the ex i 


+ re 
normality. In fact for D(s?) IH РЭ до? 


tions with finite cumulants any crit? 
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ва (уз), 88 (Узв), 26 (уә) Will be distributed like Ја, 3, (0). 52,4, (0), ... 8 а(0). Ex- 
amples of such tests are those proposed by Cochran (1941) and by Hartley (1950). Cochran's 
criterion is g = s2,,. /Xsj, whilst that proposed by Hartley is Крах = Shax./Styin.» where 
Shax. and 82,1 are the largest and smallest of the group variances 51, 53, .... 97, ..., sg. Using 
the same approximation as that adopted by Hartley it is a simple matter to calculate the 
true chance of exceeding the normal theory significance points for a D(s?) ПІ parent 
population. For instance with Ё groups of 21 observations drawn at random from such a 
Population in which у; = 1 the approximate percentage chance of exceeding the normal 


theory 5 % significance point would be: 


| k=5 | k=10 | k= 20 
| li 
M, | 17-5 | 25-2 | 37-3 
Pas: 17.3 | 23-3 | 30-3 
| 


Тће discrepancies using Fmax. are seen to be of similar magnitude to those found with M, 
ànd similar results are to be expected using Cochran’s g. ‘The multivariate tests for the 
Constaney of the variance covariance matrix from one group of observations to another 
(Wilks, 1932; Bishop, 1939; Box, 1949, 1950) would be expected to be equally dependent 
9n the assumption of multivariate normality. 

Sensitivity to non-normality is found also with sequential tests. For example, Wald 
(1947) diseussed the use of the sequential likelihood ratio test of the hypothesis H, that 
9? = o? when the alternative H; is that c? = с? and the observations 71.9%.... were drawn 
from normal populations with known mean 7. Suppose the observations were drawn, not 
from a normal universe, but from the double Х population whose distribution is given by 
equation (7). Then it will be found that the logarithm of the likelihood ratio L(ys) is equal 
to 2200) (where L(0) is the logarithm of the likelihood ratio when the parent distribution is 
normal). Consequently if the quantity 240) is calculated when 7, is not zero and referred to 
limits In (2/(1 —a)) and In ((1—A)/a), this is equivalent to referring the actual likelihood 
610) to the limits In (2/(1 — &)} and д1 {(1 — 2)/0). The actual risks of error of the two kinds 
Will therefore be а’ and / chosen so that 2'/(1— 0) = {B/(1— с) and (1 —')/®' = (а — Ayla, 
For 100а = 1000 = 5 the percentage risk of errors of each of the two kinds is given below 
for various values of Sat 


1 | 2 


7» T 


m 
| 
_ j би Бес с же 


1004 = 1002” 0-28 5-00 12-32 18-66 
| 


As with the tests with fixed sample sizes the result will be approximately true for any 
Population with finite cumulants when the average sample sizes are not small. Roughly 
es ад, В’ = B? for small values of a and р. Similar difficulties would be expected with 
Other sequential tests relating to the true value of and equality of variances; for example, 
With the tests proposed by Girshick (1946) and the tests based on ranges suggested by 
Cox (1949), 
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(8-2) A test utilizing within-group information A 

A test on variances less sensitive to parent non-normality must clearly utilize the within- 
group information in some way. Although the author believes that a better approach is 
available, one immediately practical method is to split up the groups of observations into 
sub-groups, and carry out an analysis of variance between and within groups on b» 
logarithms of the sub-group variances, following Bartlett & Kendall (1946), who have show є 
the value of the logarithmic transformation in bringing variance data to a form suitable for 
the application of analysis of variance. The question of what sizes should best be taken for 


the subgroups requires further investigation. 


Table 5. Criteria calculated for ten samples of twenty groups of twenty 
observations drawn from a rectangular population 


Test of | : Med : 
* | sta NC қ 
Source of sample ИЕ Tests of variances 
_ a 
M, on logarithms of subgroup variances 
Page Rows м, м, EO 
2 sub- 5 sub- 10 sub- 2 
groups of 10| groups of 4 | groups 9 
| К | іш оши кете 
І 1-90 14-5 6-7 96-2 10 340 
I 2140 | 22.7 8-6 18-6 n 18-1 
Тапа IT 41-50 | 138 10-8 29-6 29-3 19-8 
1-10 | ж 
II 11-30 12-6 16-7 ; ; 18:1 
m 31-50 | 196 9-9 n 2d 10-3 
III 1-20 | 269 52 17-9 10-8 11.6 
РА nm "S 21-40 12-5 16-1 40-8 99-6 19:8 
an 41–50 18-0 7-4 а › 34-1 
in aen 34-1 21-6 3 
11-30 23:8 10:3 3 19:5 
IV 31-50 11-8 10:3 de 254 29.6 
— ЕР! Eu E ET 4 — 0 E a 
Mean: Found 17-6 + 1-7 10:9 + а, 21 
+ 925129 | 30. T 29.4 4 9.8 ALA 
Expected on | 19:5 19-4 о“ E 28 $528 c 
normal theory | 221 А 
Variance: Found 29-3 + 15:6 . 7. 36 = 37 
Expected on 40-1 "4i = v4. $353 | 78-7 + 42-0 a е 
normal theory А 51:5 = 


* The groups of observations consi 
of Fisher & Yates’s tables. The page nui 


The results of a small sampling ex 
in Table 5. Ten samples of twenty gr 
of random numbers prepared by Fi 


sted in each case of t 
mbers and the nv 


s 
г ч age 
he first twenty columns of numbers from Р 
imbers of the rows aro shown above. 


& 
піў 


M. S. BARTLETT 317 


REFERENCES 


BARNARD, С. А. (1950). On the Fisher-Behrens test. Biometrika, 37, 203. 
BARTLETT, M. S. (1936). Statistical information and properties of sufficiency. Proc. Roy. Soc. A. 
154, 124. 

BanrLETT, M. S. (1952). The statistical significance of odd bits of information. Biometrika, 39, 228. 

BARTLETT, M: S. (1953). Approximate confidence intervals. Biometrika, 40, 12.* 

Совхавн, E, А. & Frsugn, R. А. (1937). Moments and cumulants in the specification of distributions. 
Rev. Inst. Int. Statist. 5, 307. 

Fisugn, В. A. (1935). The fiducial argument in statistical inference. Ann. Eugen., Lond., 6, 391. 

FISHER, В. A. (1941). The asymptotic approach to Behrens's integral, with further tables for the d test 
of significance. Ann. Eugen., Lond., 11, 141. 

Wetcu, В. L. (1947). The generalization of ‘Student’s’ problem when several different population 
variances are involved. Biometrika, 34, 28. 

Wirxs, 5. S. (1938). The large-sample distribution of the likelihood ratio for testing composite hypo- 
theses. Ann. Math. Statist. 9, 60. 


E 


* (Added in proof.) Professor J. W. Tukey has just sent me a copy of his unpublished (1949) 
Memorandum Report No. 26, Statistical Research Group, Princeton University, on ‘Standard confidence 
Points’, in which similar proposals for a single unknown parameter were put forward. 
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NON-NORMALITY AND TESTS ON VARIANCES 


By G. E. P. BOX 


Imperial Chemical Industries, Dyestuffs Division Headquarters, Blackley, Manchester, 
and University of North Carolina 


1. INTRODUCTION 


For many experimenters the most commonly used statistical tests are those for comparing 
sample means and sample variances. The test used for the equality of the means of А groups 
of observations is usually the analysis of variance test (or equivalently the / test when Ё = 2), 
whilst the test used for the equality of variances of k groups of observations is Bartlett $ 
(1937) modification of the Neyman-Pearson (1931) L test (or the equivalent F test when 
Е = 2). 

The tests mentioned are derived on a number of assumptions, in particular, that the 
observations are normally distributed. Usually, however, since little is known of the 
populations from which the samples are drawn, these tests are used, of necessity, аз if the 
assumption of normality could be ignored. So far as comparative tests оп means 219 
concerned it appears (perhaps rather surprisingly) that this practice is largely justifiable, 
for thanks to the work of Pearson (1931), Bartlett (1935), Geary (1947), Gayen (19504, b 
David & Johnson (1951 a,b) there is abundant evidence that these comparative. tests 0D 
means are remarkably insensitive to general* non-normality of the parent population. " 

Tt would appear, however, that this remarkable property of ‘robustness’ to non-normali ty 
which these tests for comparing means possess, and without which they would be much 1658 
appropriate to the needs of the experimenter, is not necessarily shared by other statistical 


tests, and in particular is not shared by the tests for equality of variances, mentioned above: 


The sensitivity to non-normality of the tests for comparing two variances was first pointe 


out by E. 5. Pearson (1931) whose findings were confirmed by Geary (1947), Finch (1950) 
and Gayen (19504). These authors showed that this test is particularly senate to chang? 
in y, from the normal theory value of zero. (The notation y,, у,, ete., will be used for E: 
standardized parent cumulants. In Karl Pearson's notation н. E УВ М = 8-3.) тан 
present paper it is shown that the Sensitivity is even greater icum fhe А. of variance? 
to be compared exceeds two, and, indeed, that the sensitivity to non-normality of the ae 
criterion or the equivalent Bartlett test can be of the same order of ma nitude ав 
sensitivity of criteria such as 5 specifically designed to test a ag 


It is further shown that the difference in itivi рал 
Sensitivity of one ^ 
i the teat on Yurtancos y of the test on means on the { 


: Get n-norm 
of the variance tests can be remedied. e sensitivity to no 


* By 'general' parent non-normality is mea; TT sicul" 
sh oe i ecg i det бп somit i pe 
an experiment in which the groups corresponded with different applied treatments to be compares 
In tests in which sample means are compared, general skewness te: a D "s : i Ер . larger effe? 
are found, however, if the skewness is in different directions in T3 у маш out; 
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2. DISTRIBUTION OF VARIANCE TEST FOR LARGE SAMPLES 


Suppose there are А groups of independently distributed observations with n, in the tth 
Sroup and Уут, = N. The ith observation of the tth group is denoted by Yu and the usual 


estimate of variance in the (th group, having ¢, = n,— 1 degrees of freedom, by 57. The average 
of the estimated variances is denoted Бу 82, where s? = X d,s?/ and Ф is the total number 
of degrees of freedom Хф, = Ф. í 

The L, criterion due to N eyman & Pearson (1931) tests the hypothesis that the 
variances in the groups are all equal, given that the observations are normally distributed. 
Alternatively, J, the logarithmic modification due to Bartlett (1937), may be employed: 


М, = 1n s?— Уфа s. (1) 
t 


Е ollowing Neyman & Pearson (1931) it may be shown that when the appropriate null 
hypothesis is true and on the assumption of parent normality, M, is distributed in large 
Samples аз Хе (where yi. ., means y? with Ё— 1 degrees of freedom). This is the basis of the 
test proposed by Bartlett (1937). He shows that, even for fairly small samples, to a close 
"Pproximation JM, is distributed as (1+ 4) А-1, where A, an adjustable constant which 
tends to zero for large values of ¢,, is given by 


1 L 1 | 
=; jaa 

> mud Фф Ф e 

It is convenient to define the quantity 

1 % E à 

ве (eigen) - i) 

When ¢ tends to infinity this becomes 
à = (1+%2)7. (4) 


Thug ô is an alternative measure of kurtosis and the quantities 2, Уз and 6 are related as 
follows: 


Ba 1 2 3 6 оо 
Va -9 -1 0 3 oo 
à со 2 1 0-4 0 


Denote by s3(Yə) an estimate of variance having ¢ degrees of freedom and based on 
9 Servations drawn from a population with finite cumulants whose measure of kurtosis 18 
Ys. Then 82(0) is an estimate from a normal population. The estimate 58 (Уз) has mean o? 
and Variance 2o1/(dé). Also from the general expressions for the higher cumulants of such 
an estimate it is seen that as ø tends to infinity the distribution of 83(Y2) tends to normality, 
“nd for sufficiently large values of ф, 83(2) is distributed like sa, (0). 
. note by M.(0; J, po; :-- Фу) the criterion calculated from normally distributed observa- 
1013 and by M (уз Уга --- Vki Фф»... Фу) the criterion calculated from # groups of obser- 
Vationg drawn from populations with measures of kurtosis Уш, Yas, ..., Vo... Yay. Then for 


ar 
| Samples М.(уз Узв :-- Vori 192 --- Фе) 
8 distributed üs Ф (X pishe (0)/} — > фи 50 (0). (5) 
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When the samples are sufficiently large we may write 8, for d,, and in the important e 
where the kurtosis in each of the sampled populations is the same, so that 6, = 6 for all t, 
My (Yo; $192 --- Px) is distributed as 


8-1(86 In s$9(0) — Y, d, In s3,,(0)} = 971 M(0; 36, dha, ..., 004). (6) 
t 


Consequently (Уз) is distributed asymptotically not as ү2 , but as (1 + 472) Ха for к. 
parent distribution having finite cumulants. We see therefore that M, is asymptotica 7 
biased if y, is not zero. In particular, in large samples the mean of the distribution curve 
М, would be (1+ 47.) (k— 1) instead of — 1 and the standard deviation (1--3y;) (2(5— и 
instead of {2(k—1)}#. The discrepancy in means relative to the standard deviation woul 
thus become larger as k was increased. Consequently divergences from normal theory 
probabilities, known to be large when Ё = 2, would become progressively larger for k> 2 
Table 1 shows the true probability of exceeding the normal-theory 5 % significance levels 1n 
large samples for various values of k and Уз when the null-hypothesis is true. 


Table 1. True percentage chance of exceeding 5 %, normal-theory significance level of Ms 
in large samples from non-normal populations for various values of Уә s 


No. of groups = k 
Уг 


2 3 5 10 20 30 


8 
4 


9 
-8 
-0 
-0 


осом 


0001 


f 
In the case when the null-hypothesis is not true it will be seen that changes in the level у 


the criterion due to non-normality and due to real differences in the group variances y 
tend to be ‘confounded’. With leptokurtic populations the criterion will tend to pe 


differences when none exist, while with platykurtic populations real differences will te? 
to be masked. 


The behaviour of the test; for me 
above. The appropriate test for the 
given that the variances are 
is the ‘within and between 
criterion used is 


ans is in sharp contrast to that for variances discuss 
hypothesis—that the means in the groups are a en 
all equal, and that the observations are normally distribut® the 
groups’ analysis of variance test. In its commonest for™ 


F = (between groups mean Square)/(within groups mean square) 
E | x "y ү, 
? sei j^ 
Alternatively, to correspond wj jth” 
of the likelihood ratio ps wi то вода employ a form obtained from the log?" 
М, = (М— 1)In {1+ (1 — 1) F/(N — 1). 


For large samples Neyman & Pear 
equivalent to M, would be distribut 


io? 
Р eri? 
Showed that assuming normality 2 erit o? 
With certain not very severe restrict PIU 
at this result is true whether the pop"l^ 


son (1931) 
ed as yz. 
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normal or not. Thus for large samples the criterion M, for means tends to follow the normal- 
theory distribution under conditions of non-normality. This explains to some extent the 
insensitivity to non-normality found by Pearson and other workers mentioned above. In 
Contrast, the criterion M, for variances follows a distribution directly dependent on y, 
however large are the samples. 

It is perhaps unfortunate that the test for homogeneity of means, and the test for homo- 
Seneity of variances in the particular case of two groups, can each be brought to the form of 
а variance ratio test. This has sometimes led to apparently contradictory statements about 
the sensitivity of the ‘variance ratio’ to non-normality. As originally pointed out by Pearson 
in 1931, the two criteria are really essentially different in character but follow the same 
distribution when the parent is normal. The marked difference in sensitivity of the criteria 
is well brought out by an example taken from Gayen’s work (1950а). A test for homogeneity 
of means of five groups of five observations leads to an F test on 4 and 20 degrees of freedom; 
9n the normal assumption this will be distributed in the same form as the F criterion used 
to compare the variances in two independent groups of 5 and 21 observations. Gayen shows, 
for example, that for a certain type of non-normal distribution in which Уі = Oand у, = 2 
the true probability of exceeding the 5% point for the test on means would be £5% 
(a quite trivial discrepancy), whilst that for the test on variances would be 10-2 96. 


3. DISTRIBUTION OF M, IN SMALL SAMPLES FOR CERTAIN 
NON-NORMAL PARENT DISTRIBUTIONS 


It will now be shown that parent distributions exist for which relations similar to (6) are 
Obtained even for small samples. 


(3:1) Population means known 


For the moment it will be assumed that the means of the Ё sampled populations are 
known, Denote by s?, an estimate of variance having n degrees of freedom caleulated from 
the n deviations from the known group mean 7. The criterion calculated from such estimates 
is denoted by М ү. If the distribution of the observations is such that 87, (y) is distributed 


ke 525, (0) for any sample size then 757, (y) js distributed like 6-1)2 о, which implies that 
(¥~n)? is distributed like д-1у2 08. This is true if 
PY) = (дрочат exp (799—026). (ооу < +00; 080). (1) 
This is a double X distribution, that is to say, it is a symmetrical distribution of disini ur 
two X distributions having 6 degrees of freedom ‘back to back’. It has mean 77, Variance о?, 
Ул = 0 and у, = 2(1—6)/6 (that is, 97 = 1+ $ys as previously defined). When 6 1 the 
term in |7— | vanishes and the distribution is normal. When 9 is less than 1, у; is greater 
than zero, and a leptokurtie distribution results. When 6 is greater than 19% й Пева Ниш 
Zero and а bimodal platykurtic distribution is obtained. For such parent distributions it is 
easily seen by the previous argument that 
М (ya; пут +++» Mp) із distributed exactly as 9M (0; ON, Ong, ..., бз), (8) 
Where Ny, Ng, ... „Ту refer to the sample sizes. 


(8-2) Population means unknown 


In 1931 Le Roux carried out a very painstaking research into the distribution of the 
Variance calculate d from small samples drawn from non-normal distributions of Pearson 
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type. He studied changes in the distribution of s? as the parent distribution changed, by 
calculating its first four moments from the moments of the parent distribution, and con- 
firmed his findings by extensive sampling experiments. One of his discoveries was thatin 
addition to the normal distribution a series of non-normal distributions existed (which he 
called D(s?) ІП distributions) for which the first four moments of s? were almost exactly 
those of a Pearson type III curve. These D(s?) ПП distributions show kurtosis accompanied 
by marked skewness. A selection of these adapted from Le Roux's table are shown below. 


D(s?) ПІ distributions 


pi 0-00 0-10 0-20 0-40 0-60 0-80 1-00 1 
уз 0:00 — 0:04 0:03 0:24 0:48 0:73 0:99 1 


+20 1:40 

125 1-51 

For such curves Le Roux deduced that to a close approximation 83(Y2) was distributed а8 
n 1 

82 (0) for any sample size (where as before d! = 1+5 35 А , à conclusion supported by 


2 


his sampling experiments. We may conclude therefore that the relation (5) is approx" 
mately true for samples of any size chosen from such populations. 

In particular, if the degree of kurtosis is the same in each of the populations sampled and 
each variance is based on the same number ¢ of degrees of freedom, then for D(s?) ш 
populations 

М (у; $, ф,...› P) is distributed approximately аз d-!M,(0; dd, dd, ...,4ф). 


Using Bartlett's method wesee that for D(s*) ITI populations therefore M (уз; Qu Boe е? $x) 
is distributed approximately not аз (1 +A) Az. , but as 


d(1-- Ad) м, (9) 


where A is the correcting factor of order ¢— defined in (2). We note in passing that departure 


from normality of this form would not need to be very great or the sample size very КА 


before the effect of the constant 4—1 completely swamped the effect of the correction term 4" 


4. Two GROUPS оғ OBSERVATIONS 


When k = 2 we may use the variance ratio F as an alternative form of the № criterion: 
Using the same argument as before and a corresponding notation it is apparent that fo! 
samples of any size drawn from double y distributions with measures of kurtosis Y21 and Уә» 


T (ys; у»; 71,7) is distributed exactly as F'(0; бул, бут), (10) 
whilst for samples of any size from D(s?) ПТ populations 
F(Y: Уза; Ф, фә) is distributed approximately as //(0; dihi d s). (М) 


These results, which are, of course, equivalent asymptotically 
for any population when the sample sizes were sufficiently Јевр, 
of how large the samples would have to be, comparison was made with the results giv? 
(11) and the more exact values calculated by other auth 


В 1 
а ae ors. Gayen (19506) and Finch ( 
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ulations defined by the first few + q „а not? 
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for various values of уз. The values of уз are assumed to be the same in each of the two 
populations. The first graph illustrates the case ¢, = 24, ф, = 60 considered by Finch, and 
the second the case фу = 4, Gy = 20 considered by Gayen. The dotted lines are the values 
given by equation (11) above. A further comparison is supplied by Pearson’s (1931) sampling 
experiments. Pearson drew 500 pairs of samples of 5 and 20 from six experimental popula- 
tions and calculated a criterion equivalent to F; this corresponds very nearly to the case 
фу = 4,ф, = 20 above (¢, is actually 19 instead of 20). In his table Pearson shows the number 
of pairs of samples for which the criterion exceeds the 4-86 % point (which again is very 
close to the 5 % point considered above). These numbers reduced to percentages are plotted 


ав circles about the second line. 


11 11 
10 10 
9 ф,=24 %,-60 
8 8 
7 7 
6 6 
РФ T 
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--- Equation (11) ж © Pearson’s sampling experiments 
a — —— —— ИЕ 
-1 0 1 
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Fig, l. Percentage probability P of exceeding normal-theory 5% level plotted against value of Ya 


The agreement between the simple approximation (11) and the values found by Finch 
is very close, as would be expected for the comparatively large sample sizes used. The extent 
of agreement with Gayen’s results is rather surprising when the small size of one of the sam- 
Ples is remembered. In both approaches it is assumed, however, that the higher cumulants 
ате finite, The marked departure from both lines which occurs when this is not so is shown 

Y Pearson’s sampling experiment from the type VII (i.e. the Student 2) distribution for 


Which Ya = 4-1 and higher cumulants are infinite. 


5. SAMPLING EXPERIMENTS WITH A D(s*) ПІ DISTRIBUTION 
Confirmation of (9) in the case k = 20 was obtained from an experiment in which samples 
Were drawn from a population nearly of type D(s?) III form. Le Roux drew 1000 samples 
h a distribution in which y? = 1-0 and 7, = 0-8. (The dis- 


.. 
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tribution was actually a Pearson type I.) He calculated the quantity (n—1)s?/n ae a 
the 1000 samples and published the resulting frequency distribution. The белер in 
grouped but the grouping interval was fairly fine. In the present work Le Roux’s = -— 
distribution of s? has been resampled, the sampling being without replacement. " se 
experiment twenty-five groups of twenty variance estimates were reconstructed E ае 
these, twenty-five values of M, were calculated. The experiment was then En e: ч 
Le Roux’s observed distribution of variance estimates for samples of = 5 from the sa 
parent distribution. 
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ИЕ бс оир, 

Table 2. Characteristics of observed distributions of twenty-five values of М, for k = 20 gr m , 
with (а) n = 20 observations and (b) п = 5 observations, drawn from a Pearson typ 
parent distribution for which y = 1-0, y, = 0-8 


| 
Expected on 3 | Expected from 
Per theory Found in | equation (9) 
| (x? approxi- | samples of 25 | (x? approxi- 
Й mation) values of M, mation) 
M, distributed as: — | 1001894, 1-415у% 
Меап 19-4 28-6 + 1-9* 26-9 
[eA | 394 93-3 + 30:31 76-1 
п = 20 БЕЗ? j 1 па ры E и 
fo 2%) 
Шын | He | gm) та 
significant 5 = | 40 -8 %) 
: 0-25 4 (16%) 3-2 (12:8 /0 
udo |) Ga 0-025 1 (4%) 0:9 (36%) 
M, distributed as: | 1-088y2, 14732 о 
{Mean 20-7 28:0+ 1.8* 98-0 
-— | Variance 44-9 79-1 + 25-64 89-4 | 
Р 
No. out of 25 | 2-5 13 (52%) 9-6 (38:2 2) 
significant 1 | и : (36 9%) 6:8 FUR %) 
tPY int | "29 89 2.8 (11-0 70 
at P %point |11 0-025 о (0 g) 0-7 (29%) 
* The quantity following the + signs is the 


the values anticipated from equation (9) 
normal theory. The excess of ‘significan 
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6. SAMPLING EXPERIMENTS WITH OTHER DISTRIBUTIONS 


Besides the population of the (5?) ІП type. Le Roux considered a variety of other popula- 
tions, mostly using sampling results used by E. S. Pearson in earlier investigations, to 
determine empirically the distribution of the sample variance. In the present investigation 
Le Roux's distributions derived from the following parent distributions have been resampled 
in the manner already described: 

Pearson type п ІП 1 Iv УП 
ҰРЫ (0-0. — 0-5) (0-5, 0-75) (1-0, 0-8) (1:2, 2-8) (0-0, 4-1) 
From each distribution, twenty-five sets of twenty samples of s? have been obtained, and 
for these twenty-five values of M calculated. These observed values of M, are set out 
in the Appendix. Table 3 shows the values of the means and variances of M, observed 
ànd those expected on normal theory, together with the number of values exceeding the 

normal theory significance points. 

It is seen that large discrepancies from the normal theory distributions occur, and the 
discrepancies are in the directions expected. The discrepancies are larger when n = 20 than 
When т = 5, and for the populations studied the asymptotic result seems to give an upper 
limit to the discrepancy which is approached as the sample size is increased. The asymptotic 
result appears to be more rapidly approached when kurtosis is accompanied by skewness 
till for the skew D(s?) III populations, the asymptotic value is attained even for small sample 
Sizes, The asymptotic result is approached more slowly when kurtosis is large and for the 
Very leptokurtie type IV and type VII distributions, sample sizes greater than twenty 
appear to be needed before the asymptotic values are closely approached. 


7. M, AS A TEST FOR NORMALITY 


The seriousness of the discrepancies to be expected, even in small samples, may be appreci- 
ated by considering JM, as a test for normality. | 

Suppose a sample of № observations is drawn from a population distributed in the double 
(7). We shall assume that the mean 7 of the population is known, so that 
Without loss of generality we may take it to be zero. We wish to test the hypothesis Но that 
the distribution is normal, i.e. that 0 = 1, с> 0, against the class of alternatives H, which 
Specify that the distribution is not normal, i.e. that ô+ 1, с> 0. Then, following through the 


(1933) for the selection of best critical regions, it is 


method set out by Neyman & Pearson à 5 
easy to show that the appropriate criterion is M , = NIn(Xy?/N)—XIny?, and that if 
M, (a) and М (1-а) are the о and 1 —« significance points for M ,, then the inequalities 


М у> М (а) and M,<M,(1—a) (12) 
defines pair of common best critical regions for testing Ay against the alternative hypotheses 
d<] (у, > 0) and 2» 1 (ys < 0), respectively, where in each case с 15 unknown. Thus in the 
Particular case when there is only one observation per group, М. „1.18 itself the most sensitive 
Criterion possible for testing for a certain type of noniine. Non-normal distributions 
Of the идола зеленой, which are defined by equation (7), would rarely be expected to 
арргохітафе to the distribution of actual data, but the above investigation suggests that 
We may form some idea of the sensitivity to non-normality of M , for small samples by seeing 


X form of equation 


how good it is as a test for non-normality. 
mulants given above differ slightly in some cases from the theoretical 
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The S ized cui ч 
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(7-1) Comparison of M_, with criteria for testing normality 
The usual test criterion for kurtosis is бо, the sample estimate of the moment coefficient 
Ё. Geary (1935а) proposed as a new test for kurtosis the criterion 


a = (sample mean deviation)/(sample standard deviation). 


This criterion had the advantage that unlike 0, the distribution when the null-hypothesis 
of normality was true could be found fairly readily. Further investigation was carried out 
by Geary (19350, 1947) to discover whether the power of this test to pick out a departure 
from normality was comparable to that of b,. Pearson (1935) studied this question by means 
of sampling experiments. One such experiment consisted of drawing ten samples each of 
Seventy-six observations from each of three symmetrical populations: 


Description 7 yt 
Rectangular 0-0 — 2:2 
Pearson type VIL 0-0 41 

0-0 2-9 


Double exponential 


* As before the actual standardized cumulants given above differ slightly from the theoretical values 
for the last two populations due to the finite size (10,000) of the population sampled. 


Pearson used his results to show the relative frequency with which the test criteria b, 
and а detected departures from normality and also to show the correlation between the 
results of the two tests. He published not only his conclusions but also the actual samples 
which he used and I have used these again to calculate the corresponding values for М 1. 
In my calculation of М, І һауе used the population mean (zero in each case), whereas in 
Pearson's calculations of б and а the sample mean was used. In samples of seventy-six, 
however, the discrepancy this introduces should not be large. A further difficulty arises 
due to the fact that the samples are grouped to the nearest unit, resulting in a frequency 
class having zero deviation from the mean which would give rise to an infinite value of M ,. 
To overcome this I have spaced the values evenly in the interval 0-0-0-5. Thus if there were 
n values in the zero frequency class these have been assumed to be at 1/(4n), 3/(4n) 
(2n — Y)/(4n). Fig. 2a shows the thirty values of 7, plotted against the corresponding 
values of b, and Fig. 20 shows the same values plotted against the values of a. It is seen that 
in each д marked correlation occurs, and if these graphs are compared with Pearson's 
Fig. 4 in which he plots а against b, it will be seen that the extent of correlation between 
М and b, and between M ,andais similar to that between а and by. So far as is known, по 
tables are available for the significance points of M, (or for the equivalent criterion L4) for 
the case of seventy-six groups with variance estimates each based on 1 degree of freedom. 
Also the asymptotic approximations of Bartlett (1937), Hartley (1940) and Box (1949) 
break down for large Ё and small numbers of degrees of freedom. Approximate significance 
Points for If , were therefore calculated by considering the distribution of Lr! = exp {M/k} 
which ranges from 1 to ©. General expressions for the moments of L, were given by 
Neyman & Pearson (1981). From these the first four moments of Гг! were calculated and 
the appropriate Pearson type curve to approximate the distribution was found to be of 
type VI. It was assumed therefore that L;* was approximately distributed as сЕ, ә, the 
curve starting at 1. By equating the first three moments of the two curves the values 
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Fig. 2a. Comparison of M у and b, in samples of seventy-six. 
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с = 2-4705, фу = 67-1998, ф = 45-2904 were obtained, and the approximate significance 
points calculated by two-way, four-point harmonic interpolation in the Р tables of Mer- 
rington & Thompson (1943). The approximate significance points thus found were: 


Upper 


Lower 


1% 


64-1 133-3 


Table 4 is an extension of Pearson’s Table VII showing the number of values of by, а and 
M „ exceeding their 5 and 1% significance levels. 


Table 4. Samples of seventy-six ; comparison of criteria 


МР Rectangular Pearson type УП | Double exponential 
Distribution y = 0-0, уз =- 1-2 у = 0-0, у = 4-1 y 00, чу, = BO 
by а Ma b, a Ma ba a м, 
Хо. of samples: б 3 " à И 
within 5 % limits 0 5 1 1 1 
between 5 and 1 % limits о 2 3 4 1 2 5 l ч 
boyond 1 % limits 10 8 4 4 6 3 4 Š 1 


For the rectangular and type VII distributions the criterion appears to be less sensitive 
to kurtosis than b, or а; for the double exponential distribution, however, M , appears to 
be as sensitive as a and more sensitive than бу. It is not of course contended that M | is 
necessarily a practical test for kurtosis ; we are concerned only to show that since 6, and a 
are presumably very sensitive criteria for detecting kurtosis, М , is much more sensitive 
even when the sample size 18 small than we should desire a test for homogeneity of variances 
to be. 

In this connexion it is of some interest to reconsider some data discussed by Bartlett & 
Kendall (1946). The results appear as a two-way table with fifteen columns and three rows. 
The entries in the table are the logarithms of variance estimates s?, each based on about 
48 degrees of freedom. The authors show that by taking logarithms the data are brought to 
à suitable form for the application of analysis of variance. Their analysis is as follows: 


DE 5,5. M.S. 
2 0-2667 0-13: 
Between rows 33 
Between columns 14 0-1047 0-0075 
Residual 28 0-1005 0-0036 
Theoretical variance — — 0-0020 
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The first three items are calculated in the usual way, and the theoretical variance is 
based on the normal theory value x,(In s?) ~ 2/(ф — 1). Compared with this theoretical vari- 
ance the residual mean square is significant (P — 0-01). It is deduced, therefore. that there 
is heterogeneity in the residual variance, and methods for further analysing this residual 
heterogeneity are discussed. Although familiarity with the distribution of their data (of 
which clearly a large quantity was available) no doubt rendered quite legitimate in this 
example the assumption by these authors of approximate normality, it is perhaps worth 
while emphasizing the dependence of this test on that assumption. For in general 


| 


kalns?) ~ 2/(d¢ — 1), 


consequently a value for y, of 1-57 would have made the residual and theoretical variances 
exactly equal. Furthermore, since any value for the theoretical variance greater than 
0-0024 would fail to show the residual variance significant at the 5 % level, any hypothesis 


that the residual variance is homogeneous and Ys greater than 0-44 is not contradicted by 
the data at this level of significance. ^ 


8. OTHER TESTS OF VARIANCE HOMOGENEITY 


The reason for the difference in sensitivity to non-normality between the criterion 2% for 
testing homogeneity of means and М, for 


testing homogeneity of variances can be seen E 
follows. In the analysis of variance (F-form) the criterion for means may be written аз P/Q, 
where P = {Уң —)%}/(Е—1) is the between-groups mean square and Q = s? is the 
within-groups mean square. When the null hypothesis is true, P and Q each provides an 
unbiased estimate of т? whether the observations are drawn from павы opulations OT not, 
so that Q is always a standard with which P may be usefully uud. But if in Mr iy 
criterion for variances, We write s? = 9*(1--2j) and formally expand the logarithms, we 
obtain M, = 1 > Ф/а,- 2) plus terms in higher powers of a, For sufficiently large value? 


of ф, we can ignore the higher powers of х, апа we have M, 


/(®Ё— 1) ~ p[q, where 
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ва (ул), Sa. (a2) 52 Sh Y'a) will be distributed like f[s5,5,(0), ва, 9,00), ....52,5,(00)). Ex- 
amples of such tests are those proposed by Cochran (1941) and by Hartley (1950). Cochran's 
criterion is g = s2,,, /У57, whilst that proposed by Hartley is Fmax. = Sinax./Smin.. Where 
Shax, and 2,1, are the largest and smallest of the group variances sj, s3, ...,s7, ..., s;- Using 
the same approximation as that adopted by Hartley it is a simple matter to calculate the 
true chance of exceeding the normal theory significance points for a D(s?)III parent 
population. For instance with Ё groups of 21 observations drawn at random from such a 
population in which y, = 1 the approximate percentage chance of exceeding the normal 


theory 5 % significance point would be: 


| k=5 k=10 k= 20 

= = — 4 
м, | 17-5 25-2 37-3 
Rm | из | 233 30-3 


The discrepancies using Fmax, are seen to be of similar magnitude to those found with M, 
and similar results are to be expected using Cochran's g. The multivariate tests for the 
constancy of the variance covariance matrix from one group of observations to another 
(Wilks, 1932; Bishop, 1939; Box, 1949, 1950) would be expected to be equally dependent 
on the assumption of multivariate normality. 

Sensitivity to non-normality is found also with sequential tests. For example, Wald 
(1947) discussed the use of the sequential likelihood ratio test of the hypothesis Н, that 
©? = о? when the alternative H, is that a? = сі and the observations y,, ys, ... were drawn 
from normal populations with known mean 7. Suppose the observations were drawn, not 
from a normal universe, but from the double y population whose distribution is given by 
equation (7). Then it will be found that the logarithm of the likelihood ratio L(y.) is equal 
to dL(0) (where L(0) is the logarithm of the likelihood ratio when the parent distribution is 
normal). Consequently if the quantity L0) is calculated when уз is not zero and referred to 
limits In (/(1—2)) and In (1— Буш), this is equivalent to referring the actual likelihood 
910) to the limits 6 In 2/(1- a)}and д1 ((1— P)[a3. The actual risks of error of the two kinds 
will therefore bea" and f’ chosen so that J'/(1 — ^) = {8/01 — a)? and (1 — | = {(1— 2)јаде. 
For 1000 = 1008 = 5 the percentage risk of errors of each of the two kinds is given below 


for various values of у»: 


Ya 


100a’ = 100)” 0-28 5-00 | 12-32 18-66 


As with the tests with fixed sample sizes the result will be approximately true for any 
Population with finite cumulants when the average sample sizes are not small. Roughly 
a! = оё, 8' = fh for small values of с and f. Similar difficulties would be expected with 
Other sequential tests relating to the true value of and equality of variances; for example, 
With the tests proposed by Girshick (1946) and the tests based on ranges suggested by 


Cox (1949). 
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(8-2) A test utilizing within-group information | 

A test on variances less sensitive to parent поп-погта у must clearly utilize the cel 
group information in some way. Although the author believes that a better approach 18 
available, one immediately practical method is to split up the groups of observations into 
sub-groups, and carry out an analysis of variance between and within groups on е 
logarithms of the sub-group variances, following Bartlett & Kendall (1946), who һа m show п 
the value of the logarithmic transformation in bringing variance data to a form suitable for 
the application of analysis of variance. The question of what sizes should best be taken for 
the subgroups requires further investigation. 


Table 5. Criteria calculated for ten samples of twenty groups of twenty 
observations drawn from a rectangular population 


Source of sample* | pod | Tests of variances 
| | M, on logarithms of subgroup variances 
Page | Rows м, м, 
2 sub- 5 sub- 10 eae 2 
| groups of 10| groups of 4 | groups 0: ^ 
Seals - u = === 
I 120 | 145 67 26-2 103 340 
I 21-40 22-7 8-6 18-6 16-1 181 
Iand II 41-50 13-8 10-8 29-6 29-3 19-3 
1-10 | 
II 11-30 | 12-6 16-7 30-7 5. 18-1 
IT 31-50 19-6 9-9 408 cie 103 
ІП 1-20 26-9 5.2 17-9 10-8 116 
HI 21-40 12-5 16-1 40-8 29-6 198 
III and IV 41-50 18-0 7-4 34-1 21.8 34-1 
1-10 ~ 
IV 11-30 23-8 10-3 37-9 7 19:5 
IV 31-50 11:8 10-3 30-6 24 29:6 
Mean: | Found | 176217 | 102412 30 ; "es 27 
+ 7+9. 93:44 9. Ate 
Expected on | 19-5 19-4 вс 6 - kun ы 213 
normal theory шы! 2i 
Variance: Found 29-3 + 15-6 | 13-94 7.9 М 7 37:9 
Expected on 40-1 39-4 ap di +353 | 787 + 42-0 а т 
normal theory 51-5 я 
ж The groups of observations consiste 
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table. In the remaining three columns are shown the values of M, for analysis of variance 
performed on the logarithms of subgroup variances. The groups of twenty observations 
were divided into subgroups in three different ways: (i) two subgroups of ten observations, 
(ii) five subgroups of four, (iii) ten subgroups of two. The means and variances of the 
Observed values together with their approximate standard errors and the values expected 
on normal theory are shown at the bottom of the table. 

With so few samples, only large discrepancies of course would be detectable. We see that 
the test on means shows no evidence of departure from the values expected on normal 
theory even though the parent population is rectangular. The Jf, test for homogeneity of 
variance on the other hand shows extremely large departures, the mean being only about 
half the value expected on normal theory (for the rectangular parent distribution the 
asymptotie mean value is two-fifths of the normal-theory value). In contrast, it is seen that 
all the tests for homogeneity of variance based on M, give values agreeing fairly well with 
What would be expected if it could be assumed that the distribution of the logarithm of the 
variance was exactly normal, an assumption far from true, particularly for subgroups with 
only two observations. 


9. DISCUSSION 


It has frequently been suggested that a test of homogeneity of variances should be applied 
before making an analysis of variance test for homogeneity of means in which homogeneity 
of variance is assumed. The present research suggests than when, as is usual, little is known 
of the parent distribution, this practice may well lead to more wrong conclusions than if 
the preliminary test was omitted. It has been shown (Welch, 1937; David & Johnson, 
19515; Box, 1952; and Horsnell, 1953) that in the commonly occurring case in which the 
group sizes are equal, or not very different, the analysis of variance test is affected sur- 
Prisingly little by variance inequalities. Since this test is also known to be very insensitive 
to non-normality it would be best to accept the fact that it can be used safely under most 
Practical conditions. To make the preliminary test on variances is rather like putting to sea 
ina rowing boat to find out whether conditions are sufficiently calm for an ocean liner to 
leave port! s А 

When the groups of observations were of unequal size and differences in variances might 
Occur it would seem logical to replace the usual analysis of variance criterion which uses 
а pooled estimate of within-groups variance by the alternative criterion proposed by Welch 
(1951) and by James (1951), i.e. Хи), where w, = m/s}. This criterion is robust to 


inequality of variance and almost certainly to non-normality bed (In fact, where inequali ty 
of variance might occur it would seem most logical to use Welch’s criterion even if the groups 
Were equal.) | - | 

When a criterion for testing a statistical hypcthesis is derived (for example, by the 
likelihood ratio method), it is usually necessary for purposes of mathematical convenience 
to Over-simplify the specification of the problem. We should not be surprised therefore if 
an examination of the resulting criterion shows that the assumptions have sometimes, so 
to Speak, been interpreted rather too literally. For this reason it is most important that 
derived criteria should be studied for robustness. | | | 

The property of robustness I believe to be even more important in practice than that the 
test should have maximum power and that the statistics employed should be fully efficient. 
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* 
џ "mer. 
Where necessary I believe that the latter qualities should be sacrificed to ensure the for € 
H = m e 
On the other hand, I do not think that we need necessarily go to the extreme of using 


"= 3 n be 
parametric tests when it may well be that more powerful robust parametric tests са 
found. 


А ; valuable 
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D B 


2 5 аа rish to 
suggestions for the improvement of the presentation of this paper. In conclusion, I wis 
thank Mrs Margaret Edmondson for valuble assistance with the computations. 
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APPROXIMATING TO THE DISTRIBUTIONS OF MEASURES 
OF DISPERSION BY A POWER OF y? 


By J. H. CADWELL, Ordnance Board 


1. INTRODUCTION 


The use of range in place of sample variance in various statistical tests has been the subject 
of a number of recent papers. In order to make range and means of sets of ranges amenable 
to certain lines of attack, two approximations have been proposed for the case of a normal 
parent population. The first consists in replacing range bya multiple of X with an appropriate 
number of degrees of freedom. The two parameters at choice allow the mean and variance 
of range to be matched by those of the approximating form. This method has been discussed 
by Patnaik (1950), Hartley (1950) and Florin (1950); David (1951) gives tables of the 
necessary constants. The second approach, due to Cox (1949), replaces range by а X? form. 
Pearson (1952) has compared the errors arising from the two methods. He points out that 
either can be regarded as an approximate transformation to the X? form, and thus a number 
of standard tests are available. For instance, Bartlett/s test for homogeneity of variance ог 
the short-cut tables prepared by David (1952) may be used. Similarly, 
ranges can be replaced by an approximate F-ratio. 

Below we graduate the distribution of range by a multiple of a suitable power of №. 
The extra parameter allows the first three moments to be matched, and much smaller errors 
result. By inverting the relationship an approximate transformation to the X? form results. 
It should be noted that, while the X approximation is dimensionally correct, the others are 


not. Thus the increased accuracy is obtained at the price of some loss of applicability of 
the approximation. 


the ratio of a pair of 


The method gives good results up to sample size 20 and the necessary constants are 
tabled. Results for the mean of a set of m ranges are also satisfactory and constants are given 
for m from 2 to 5 and п from 2 to 10, As there are no probability integral tables available for 


this statistic, these constants may be used to supply approximate percentage points from 


those of 2. 

As the power of the statistic needed de 
of an F-ratio test is restricted to v 
5 and 1 % short-cut tables c 


In a paper to be published shortly, moment co 
This statistic is defined as the difference betw 
of the ordered values in the sam 


asi-range are given- 
atest-but-one and the least-but-0n* 
beyond 17 it is more efficient tha? 
€ss influenced by the presence ofan 
he normal model. The same method 
ts is given. 

and transformation constants and 5 


ation. It is also ] 
В ‘ 2 1 
occasional ‘rogue observation, or by departures from t; 


of graduation gives good results and à table of constan 
Mean deviation can be graduated in the same way 
and 1 % short-cut tables are provided. | 


ы 


~ 
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Wilson & Hilferty (1931) have shown that, for large degrees of freedom, the cube root of 
X? is almost normal. This leads to an approximate normalization of statistics that can be 
graduated by a power of д. An example is given in the section on range. 

Another transformation of some interest is the use of the logarithm of ү?. This statistic 
has a variance independent of the parent value of the standard deviation, and can be used 
ifitisdesired to apply analysis of variance techniques toa set of observed standard deviations. 
However, while the statistic approaches normality in large samples, it does so rather slowly. 
Bartlett & Kendall (1946) suggest that the method should be used with caution for sample 
sizes below 10. We see that the logarithmic transformation when applied to range, or any 
of the other statistics considered below, will have the same properties. The variance of 
In w will still be independent of the parent standard deviation and will be approximately 
Са In no] .. 


The errors introduced by the approximation to w will be small in comparison with the 
departures from normality of In y?. Thus conditions for the use of the logarithmic trans- 
formation of range or mean deviation will be similar to those for its use with standard 


o? 


deviation. 
It seems likely that other applications of this method of graduation may occur. Thus non- 


central y? can be well approximated by a central д? on a suitably chosen number of degrees 
of freedom. Still betterresults could be obtained by the present method. The use of frequency 


Curves of the form а-+(х?)= (x2 оп v df.) 
might also repay investigation. 
2, DETERMINATION OF CONSTANTS 
oe : 
Consider the quantity a = (4x2), 


where X? is on v degrees of freedom. The moments of v are readily found; they are given by 
А о 


the following relation: КЖ) М 
H7 сту) 
Using Stirling’s series, the expansions below follow at once: 
2 —1)? , (x—1)*(5z— 1) (x — 5) , 
-te-a 10) [i-o ) +! С 240% i | (2) 
1 
Уй _ Mills 3_1„ 34-19, —— | 
V Из a v 
On solving (2) and (3) for v we obtain 
2 | pr peT., ii 
” = GY-JAY ову = 4) 
Inversion of (2) gives the series 
2 
_ су{%—1)# 20090 = 1)#(18 + 220 +1) | ма m 


с = 6 Әр 24у? 


у 
Where а = JG) V. 
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Ifthe values of V and £, for a particular statistic и be inserted in (4) and (5), we shall have 


the results u is approximately distributed like (y?/c)*, (6) 


cu^ is approximately distributed like д. (7) 
The constant c is chosen to make the means of the two variates in (6) agree. We have 


a= : and loge = log2+A{log Г(а + $v) – log Г(3») – log uj. (8) 
The expression for loge is readily evaluated using the log factorial tables prepared by 
Brownlee (1923). 

For small v or large о, (4) and (5) will not be sufficiently accurate. The following procedure 
was found to be satisfactory for range, quasi-range and mean deviation. 

Since (5) comes directly from (2), the use of an approximate value for у, while affecting 
the f, match of u and x does not affect the matching of mean and variance. Thus, for v > 10 
it is found sufficient to round to the nearest integer and substitute this value in the first two 
terms of (5) to determine о. 

When 10 и> 4 it is better to take v to the nearest 0-1, and three terms of (5) are needed. 
In practice it was found preferable to solve 


72+1 = Teip · (9) 


Thus, for mean deviation with n = 5, two terms of (5) give ж = 0:554. Using (9) twice 
тейле а = 0-55, V? = 0-13655, 
а = 0:56, V?=0-14131,. 


Inverse linear interpolation for the required value of V2 (0-13929) gives æ = 0:5558. The 


use of trial values of æ with a spacing of 0-01 obviates interpolation in Brownlee’s tables 
and gives a maximum error in а of a few units in the fifth place. 


When у <4 the series (4) still gives very good results. Thus for range with n = 3 it gives 
2:08, the true value being 2-05. However, it is now advisable to retain the second decimal 
in у. Thus it becomes necessary to solve (9) and the relation 


f, V34-3y24.1 = 782 + (Tbe (10) 
Qe c 
This is readily done using trial values of v rounded to 0-02 i і 
A *02 to ob i i the 
log factorial tables. Thus, for range with n = 4, (4) gives у = ie ici ipie десе А Jf, 
at each of the four points 9:23. The values of V? an 


у = 3:20, a = 0-52, У = 3:24, а = 0-52 

у = 3:20, а = 0-53, У = 3:24. a= и 

are then found using (9) and (10). Double inverse lineari ke | 
от found wing ја ear interpolation for the required value 


v = 3:202, a= 0-5957. 
With an interval of 0-04 in trial values of у, ne 


1 ; a 
slightly in the fifth decimal place in the most u glect of second differences only affects 


nfavourable сазе. 
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It might be desirable to use a fixed value of A for different sample sizes. This value could 
о 
be chosen as a compromise between the values given for the various sample sizes in Table 1. 
The values of у for each sample size will be found from 


i vo (a 1)? о аон (11) 


20; 4 1 
where у= ту and 2-3. 


When « = } this reduces to the series developed by Florin (1950) for the Х approximation. 
Two decimals in > are usually adequate, and for vy < 4 we solve (9) for v using trial values 
Suggested by (11). 

It sometimes happens that all but one or two samples are of the same size; the A appro- 
priate to this size may then be used throughout. For the exceptional samples > and с are 
found from (11) and (8); for the standard size the values of Table 1 are used.* 


3. RANGE 


In order to apply an approximate test of homogeneity to a set of ranges, each for the same 
Sample size, we first transform to approximate x? values using the constants of Table 1. 
Then the usual Bartlett test is applied to these values. The adequacy of the approximate 
transformation to the x? form was investigated as follows. 

Tabular values of и, as near as possible to the percentage points listed below, were trans- 
formed by (7). The approximate probability integral was then found from tables of the 
x: integral. This was compared with the exact value in tables of the probability integral 
of range prepared by Pearson & Hartley ( 1942). Errors in the approximate probability 
integral, expressed in units of the fourth decimal, are tabled below: 


Y%point| q.s 5 20 50 80 95 99-5 
n o 2 
= +2 +3 —2 == =] 
У zi = +5 +5 -3 -2 0 
10 -3 -3 +5 t4 -4 =l 0 
15 ШЕ. 0 +7 0 -3 0 +1 
20 m +1 +5 -2 -3 +1 


David's (1952) 5 and 1% short-cut tables for the maximum F-ratio, transformed by (7), 
Zave Tables 3 and 4. Exact values of у, not the rounded values of Table 1, were taken. 
A four-point Lagrange formula was used when interpolating for ›, апа harmonic values 
ОЁ у were employed when it exceeded 10. For small v the use of variance ratio became im- 
Possible, but the ratio Of Smin. 60 Smax. WAS quite satisfactory for interpolation. The table 
for the 1 % values was not taken ice * E T 2 i ші and rounding errors 

i ^ iliti recia eyond this value. 
ы: P ик сүрүүсү" аа Tf, in a set of k ranges, M denotes the ratio 


9f maximum range to minimum range, we 
=1-k bé FM) =F (æ) f (2) ах. (12) 
Pr. (М>»М,) = 1 К (М, 


ave been placed together at the end of the paper. 


have 


* All numbered Tables h 
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Using values of M in Tables 3 and 4, the following probabilities were found by quadrature: 


% point k=2 k=6 k=11 

| 
п= 5 | 0-0506 0-0514 0-0526 
5% n=10 | 0-0508 0-0504 0-0516 
n= 20 0-0515 0-0516 0-0496 
1% n=5 0-0104 0-0106 0-0107 
n=10 0-0103 0-0102 0-0094 


"These values show a slight bias, but are satisfactory for most purposes. In view of the size 
of the error at n = 5, k = 11, the true probability was also found at the nominal 5 % level 
forn = 4, k = 11; it was 0-0513. While the effect on probability of a change of + 0:01 in the 
ratio varies over the tables, average values are — 0-0004 for the 5 % tables and — 0:0002 
for the 1 % table. 


We now turn to another application of the method. 
uses the fact that, for large v, 


[2 ERR, CRM 
4(9) =] pt 0 ), var. > =y Pow 3). 


In addition, the distribution of this quantity is ne 
we find 


The Wilson-Hilferty approximation 


arly normal. Applying this fact to range 
в approximately Nr. {1-6443, 0-3844},* 
is approximately Nr. {1-7163, 0-2197}. 


The errors in the approximate probability integral, found from the normal integral, аге 
shown below in units of the fourth decimal: 


[7 9088 i 


0-4881 
wio 


% point 0-5 5 20 50 80 95 99-5 
п=5 +20 +19 -15 -16 -10 =i 
| n=10 +1 -1 | -8 = 1 +2 % -1 


While the approximation is good at n = 10, there are large errors when їз as small as 5. 


4. MEAN RANGE 


splitting the sample 
as an estimator of standard 
um method of splitting the 
› Will often arise naturally № 


dispersion, but may take different mean levels in the various s 


d to transform the me 
an 
of т, to the x? form by means of (7). Except in one case disou 


is available for checking this approximation direc 
tly. 
was used. y Conseq 


amples. 

f m ranges, each for a sample 

ussed later, no simple metho 

uently the following procedure 

* Nr.íp, 0} is used to denote a normal distribution. having mean =, and standard deviation 9” 
EE. ы 5 ат еуі. 
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While the two variates in (6) agree in their first three moments, ће 2, values will differ. 
Using (1) the following values were found for this f difference. The 0, of u was the larger 
of the two throughout: 


n т-і тж? т= 3 m=4 
2 0 —0-013 — 0-012 — 0-008 
3 | — 0:005 — 0-004 — 0-002 — 0-002 
4 | — 0-010 — 0-005 — 0-003 — 0-002 
10 | — 0:015 — 0-009 — 0-006 — 0-004 


The percentage point tables prepared by Pearson & Merrington (1951) show that, for the 
region of the (f,,) plane concerned, a given Дь error has a greater effect for large values 
of 8, than for small: its effect varies but little with #%„. For the largest value of £, in the above 
table (n = 2, т = 1) the transformation is exact. The next largest, at m = т = 2, also shows 
the second largest J, difference. 

The probability density function for a single range is of simple form when n = 2, and we 
find by integration that the mean of two such ranges has the density: 


о w офа? 
"- ез| йа, 
\/\(?л) o у(2л) 
The probability integral of 7 was found by quadrature, and the previous method of checking 
used to give the following errors in units of the fourth decimal: 


| 
20 | 50 | 80 | 95 | 99-5 


% point | 0 


E 


кз 


Error | == | -5 | +4 | +11 | елі | =й | ИК 


Since (A, В») approaches the normal point аз m increases, this test, together with those made 


orm = 1 in the previous section, may be assumed to cover the least favourable cases, 


5. THE FIRST QUASI-RANGE 


Іп Samples too large for the direct use of range, it may be necessary to randomize the 
Observations before splitting them into groups. For instance, we may suspect that values 
are not independent of the order in which they are recorded. The lack of uniqueness may also 
ев drawback to the use of average range in some circumstances. 

п such cases the first quasi-range, defined as the difference between the greatest-but-one 
ӛлі the Jeast-but-one of the ordered values in the sample, may prove useful. Using the 
ange given by Tippett (1925) and reproduced in Tables for 


ta йе 3 
S of ex values of 1 4 са 
of expected Vol. 2, the mean value of this statistic can be found from 


‘atisticians and Biometricians. 


\ 


2@(х„_) = 2n(n— ». FQ -Py-dF 


(03—23) 
2n(n— f (1 — Fy-2dF -2n(n— vf (Рае 


п (Wy) — (n— 6 (и). 
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Using the first three moments of quasi-range* the constants of ible 1 were found. la 8 
enable approximate percentage points to be deduced from those of X by using (6). a 
in the approximate probability integral of quasi-range found from (7) are given below 
units of the fourth decimal: 


- | 0-5 5 20 50 80 95 99-5 
^" 5... — ыы 
10 —1 => +1 +4 0 -1 0 
20 => 0 +6 +2 = es] +1 
30 -2 | +1 +4 -2 -2 -1 +1 


In order to avoid double interpolation in the Hartley-Pearson (1950) tables of хе, values 
of v have been rounded to the nearest even integer when above 30. 


6. MEAN DEVIATION 


Work with this statistic was on the same lines as for range. Errors in the probability 
integral appear below; these are expressed in units of the fourth decimal: 


mm 1 М 
le nd 0-5 5 | 0 50 80 95 | 99-5 


e: 


3 =i -2 +2 +3 -2 -2 | -1 
4 ED -5 +7 +10 | =r E Tl 
5 -2 c4 | ee | же | 6 = +1 
6 -2 = +5 + 3 -3 0 +l 
| 10 =1 -1 +1 +3 | 0 


As no tables are available beyond sample size 10, £, differences in (6) were again examined. 
Series expansions for f, and f, due to Geary, are given by Pearson in an Editorial Note 

to a paper by Godwin & Hartley (1945). Godwin (1948) gives a minor correction to the /% 

series. Fisher (1920) derived the exact distribution of mean deviation when n = 4, 22 


gave the values f, = 0-297, В = the values 0-299 and 3-252 respe% 
tively. 


An examination of Figher’s 
deviation estimator of с) given 


3:28. The series gives 


paper shows that the fourth moment of c, (the mea? 
there should be replaced by 

1657 
3849: (%=cos-1}), 
Using this value we obtain f, = 0-2983, В» = 
series. 


Эт 
515 (57 — Та) + 


3 
: 8 
3:252, in very good agreement with Geary 


The values of Ё, differences found are shown below: 
ý У ia 5 6 ю | 15 m 
ТІ | | (ді m — 
A, difference | —0-005 | ое | 00 017 | -ooa 0-008 | 0004 | —0:008 
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The 1 “се 1 t О: 
arge у: Sé H Қ і і 
inte T L alues at 4 and 5 are reflected in the err rs found in the approximate pr obabi ity 
‘al. As mean deviation moves steadily tow T i with increase of n ia ea 
gra \ owards normal i i 1 
: 8 па. ity wit Қ 
<pect errors be yond 10 to be small. EM ü : | 
Tables [: ) giv e i per 2 уа, 1aximum 
5 апа 6 give the approximate upper and 1 % values of the ratio of m i 
eviation to minimum mean viati i fs 
deviation in a of k val Usi 2 
mean d tio | set of Буа ues. sing ( 2 ) the fo i g 
2 > own, 


vi ld 
alues of probability were found from these tables by quadrature: 


% point | k=2 k=6 be dd 
B | 
5% п-4 0-0510 0-0520 ores; | 
n= 10 0-0506 0-0506 | 005009 | 
(ees Jo | 
1% n=4 | 0-0104 0:0108 | 0011 
n=10 | 0-0100 0-0101 00099 | 
HET ‘a | 


Tt wi 
5% zw be seen that errors are similar to those for range. For sample sizes below 10 it will 
ound that the critical ratio differs very little for range, mean deviation or edicto, d 
indar 


deviation. 
The use of rounded values of v in Table 1 obscures the fact that A tends to a limit as 
a n 


ы this can readily be proved by using Geary’s series. This suggests that, with a 

Gotan Oss of accuracy, а fixed value of A could be employed. It turns out that 1-8 is a satis- 

ie y value, and appropriate tables are being prepared. The use of a fixed A greatly in- 
ases the flexibility of the method by removing the restriction to a fixed sample 4 


раа like to thank Mr D. F. Mills for his assistance in the preparation of the tables 

mak indebted to Prof. Pearson for his advice on the presentation of this work, and ғ 
А hg an advance сору of the short-cut tables of critical variance ratios available, 

cknowledgement is made to the Chief Scientist, Ministry of Supply, for permission to 


Publish this paper. 
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Table 1. Transformation constants for range, mean deviation and first quasi-range 


| эмы 
Range | Mean deviation | First quasi-range 
n | т 
v À log с v А log с v А log с 
10 | 80 | 17280 | 0-3590 
11 | 93 | 17144 | 03880 
2| 100 | 2 T6990 100 | 2 0-3010 | 12 | 11 16683 | ОИ 
3 | 205 | 19619 | T7632 | 2-05 | 1-9619 | 0-5766 | 13 | 12 | 1.6810 т 
4 | 320 | 19029 | Т.8453 | 3:35 | 1-8437 | 0-7630 | 14 | 14 16271 | 051 
5 | 45 | 18298 | 1.9393 | 46 | 157992 | 0.8851 | 15 | 16 1-5845 (404% 
6 | 60 | 1-7489 | 0-0412 59 | 1-7655 | 0-9823 | 16 | 18 | 1.5497 бш 
T| T7 | 16700 | 014276 | 7-2 | 157497 | 1-0612 | 17 | 20 | 15202 фар 
8 | 95 | 16046 | 09315 | 84 | 1-7394 | 11235 | 18 | 99 1-4950 | 0:6905 
9 | 12 155018 | 0:3543 | 9-6 | 1-7355 | 141778 | 19 | 24 14730 | 0:728 
10 | 14 14643 | 04218 | 11 1-7164 | 1-2326 | 20 | 27 | 14257 | 07857 
‘11 | 16 1-4292 | 0-4794 | 12 1-7310 | 1-2695 | 21 | 29 1.4102 | 08125 
12 | 19 13605 | 0-5748 | 13 17428 | 1-3036 | 22 | 32 1.3738 | 0862 
13 | 22 1.3075 | 0-6537 | 15 1:6918 | 1.3596 | 23 | 36 1.3236 | 0:9290 
14 | 26 1-2392 | 0-7521 | 16 1:7036 | 1-3874 | 24 | 38 1-3153 | 0.9492 
15 | 30 11860 | 0:8338 | 17 | 1-7144 | 1-4136 | 25 | 42 | 10755 | 1:0036 
16 | 34 11427 | 0-9036 | 18 17235 | 1.4383 | 26 | 46 | 12413 | 1:0530 
17 | 40 L0783 | 1-0030 | 20 1.6872 | 1-4799 | 27 | 50 | 1.2115 | 10977 
18 | 46 10276 | 1-0858 | 21 1.6906 | 1-5012 | 28 | 54 | 1.1853 | 11384 
19 | 52 09865 | 11589 | 22 | 17053 | 1-5215 | 29 | s8 | 11618 115 
20 | 60 09558 | 12431 | 23 | 1-7129 | 1.5409 | 30 | 62 11409 | 1-210 


If the statistic is и, then we have: 


cu^ has approximately the x? distribution on у degrees of freedom. 
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Table 2. Tra nsformation constants for the mean of m ranges, each for a sample of size т 


m=2 т==8 
n 
у А | log c v À logc 
| то - ---- дет 
| 2 2.22 1-8376 0164 3-46 1-7822 048918 
3 44 1:8539 0-1749 6-7 1-8282 0-3791 
4 6-7 1-8339 0-2217 10 1-8302 0-4064 
5 9-3 1-7822 0-2948 14 1.7740 0-4823 
| 6 12 1.7385 | 0-3617 18 1-7337 0-5449 
i 7 15 1-6849 | 0-4364 23 1-6633 0-6355 
М 8 19 1:5995 0-5434 29 1-5838 0-7372 
| 9 23 1:5375 0-6283 34 1-5477 0-7952 
10 28 14620 | 0:7285 42 1-4613 0-9066 
т=4 m=5 
n 
v À log c v А log c 
| 
2 +7 1-7562 0-5408 6-0 1-7310 0-6572 
3 | 90 1-8152 0-5178 11 1-8339 0-6044 
4 14 ШЕШЕНІ 0-5739 17 1-8054 0-6529 
5 19 1-7547 0-6257 24 1-7446 0-7330 
6 24 1.7319 0:6731 30 1-7310 0:7719 
Y 30 1-6812 0-7449 38 1-6683 0-8544 
8 38 1:5959 0-8503 48 | 15870 0-9567 
9 46 1:5354 0-9336 58 | 15288 1-0381 
10 56 1:4605 1-0327 70 1-4607 1-1300 
Tf t} istic i haves 
а кырын не distribution on v degrees of freedom. 


cud has approximately the № 


Та А ‘ io of maximum value to minimum value % 
i Мез. д pproximate upper 5 % points of t he ratio of : i value in 
a set of k independent ranges. Each range is for a sample ој size n from a normal population 


ој fixed standard deviation 


NE ғ“ 
ЛЕР и 4 5 | • 7 8 9 10 11 12 
n Ж ып Жин жиш жие : 5 
35. о |142 |163 |182 |200 |2L7 |292 |248 | 26-3 
4 dee m 118 | лао | 795 | $63 | 934 | 976 |103 |107 |111 
5 419 HET 4.63 5-10 5:53 5:93 6-26 6-55 6-80 7-05 7-98 
в | a» | 416 | 447 | 471 | 498 | 514 | 532 | 5-50 | 5-66 
7 P 3:37 HER 3.61 | 385 | 404 | 422 | 437 | 451 | 463 | 4-75 
&| 25 | 25] Бог | or | 249] 202] S8 У | те | eee | а 
о | 232 | 275 | 305 | 303 | 310 | 332 | 344 | 355 | 364 | 373 | зві 
i| $e ТЕТЕ Ч ы | Sd | 303 
19 en | 272 | 282 | 290 | 298 | 3:05 | 3:11 | 3-16 
ig | 197 | 226 | 246 | 290 | 246 | 254 | 261 | 267 | 273 | 278 | 2-83 
20 A ер TM pee 214 | 221 | 228 | 234 | 238 | 242 46 | 2:50 


346 Approximating to distributions by a power of у? 


Table 4. Approximate upper 1% points for the range ratio 


\F | 2 3 4 5 6 7 8 9 10 u 12 
т 
denn | 
3 |141 |210 |267 /|3L8 |364 |407 |448 |485 |591 |554 | 58-7 
4 | 691 | 928 |110 |124 |137 |148 |158 |167 |176 |185 1191 
5 | 487 | 616 | 706 | 7-76 | 844 | 9-04 | 951 | 9-93 | 10-5 | 10-8 |112 
6 | 3-96 | 480 | 540 | 586 | 630 | 666 | 699 | 735 | 7.51 7.716 | 7:96 
7 | 944 | 408 | 452 | 486 | 516 | 543 | 566 | 586 | 606 | 626 | 646 
8 | $11 | 363 | 398 | 427 | 450 | 471 | 488 | 505 | 518 | 531 | 544 
9 | 288 | 338 | 3-62 | 3-86 | 405 | 420 | 4-34 | 447 | 460 | 4.711 | 481 
10 | 271 | 3:10 | 835 | 356 | 3-73 | 3:87 | 400 | 410 | 4-90 | 4.99 | 438 


Table 5. Approximate upper 5 9/, points Jor the ratio of maximum mean deviation to minimum 
mean deviation in a set of k independent values. Bach val 


ue is derived from a sample of 
n from a normal population of fixed standard deviation 


WO в 3 4 5 6 7 2 
vi 8 9 10 11 1 
pee 
3 | 628 | 932 |1L9 |142 |163 |182 |200 | 94.7 : оң. 26-8 
4 | 3:96 | 5-32 | 6:31 7.20 | 795 | $64 | 9.22 | 974 d E 11:2 
5 | 317 | 404 | 464 | 513 | 555 | 5-06 | 6.30 6:59 | 685 7.09 | 7-81 
6 | 2-74 3:37 | 381 | 416 | 447 | 4-70 | 493 T 5-65 
| #48 | a8 | 588 se | Sen 4o | зло | &x | 58 | се | РА 
8 | 230 | 272 | 901 | 323 | 342 | 357 | зло | 324 3-95 | 404 | 413 
9 | 217 | 254 | 278 | 297 | 313 | 326 | 337 | 284 3-57 65 | 373 
МУ | 1 | 528 | SO | Ser | ax | Beg | 512 | за. 3:29 3.36 3-49 
12 | 192 | 219 | 236 | 250 | 260 | 970 | 2 ; 
15 | 1-77 | 199 | 213 | 224 | 232 | озу bud ~ 291 | 2-96 ET 
20 | 1-63 | 1-80 | 191 | 199 | 205 | 210 | 215 in |581 340 | 22 
30 | 149 | 1-61 | 168 | 174 | 178 | Lee | 145 iu | 222 | 226 | T 
90 | 181 | 188 | 148 | 148 | ds | зво | тд 14 | LI | 198 | Те 
* | 100 | 109 | 100 | 100 | 1:00 | 10 | тор ^ n: s 1.00 
Table 6. Approximate upper | ; 
pper 17% points for the mean deviation ratio 
К 2 3 4 5 6 7 E 
E 8 9 10 11 12 
3 |161 |210 |967 |318 | 36.4 40-7 = m 
4 692 | 9:30 | 11-0 12-3 13-6 148 448 |485 52-1 55-4 58:7 
а В 3. p^ 168 |176 |184 |199 
6 896 479 541 | 586 | 630 | ggg 100 [105 |109 |H 
7 5 -07 4-51 4-85 5. ` 6-98 7. 7:96 
8 | 308 |340 | 395 | 422 | 515 eal | оба | nas eos | 225 | а: 
ІШІК | 255 | зао | Зов | $97 | ав | Fee | 696 | g 540 
10 2-65 3-02 3-27 415 А 5-15 5:28 72 
8-47 3-64 377 5 29 | 4-42 4-53 4-63 4 26 
89 | $99 | 409 | 448 | 4 
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THE POWER FUNCTION OF SOME TESTS BASED ON RANGE 
By H. A. DAVID 


1. INTRODUCTION 


Tn a number of recent papers (Patnaik, 1950; Hartley, 19504; David, 1951) tests based on 
the sample range have been developed to deal with the analysis of simple and double 
Classifications as well as of some more complex orthogonal designs. It is our purpose to 
Compare the power function of these short-cut tests with that of the analysis of variance 
Procedure usually employed. 

The new test criteria which may be used in place of the F-ratio are of two forms, depending 
Оп whether the test is one for a main factor or an interaction. In the former сазе the test 
function is the ra tio of a single range to an independent mean range estimator of the popula- 
tion standard deviation с, all ranges being taken over a small number of variates. In this 
Situation a mean range can be closely approximated by a y-distribution (with fractional 
degrees of freedom v) so that the test ratio may be referred to tables of the ‘Studentized’ 
Tange 4. The same y-approximation allows us also to deal with the ratio of two independent 
Mean ranges which arises in a test for an interaction. 

The power of these tests has not been determined for any of the above designs, except in 
Special cases, However, there has been evidence suggesting that the tests are likely to be 
reasonably powerful. Thus: (a) It is known that for a small sample size n the range provides 
а fairly efficient estimator of с and that for large n (т > 12) a very good estimator of c is 
given by the mean of ranges taken over small subgroups of the original sample (Davies & 
Pearson, 1934: Grubbs & Weaver, 1947). (b) For all of the above classifications tables are 
available giving the ‘equivalent degrees of freedom v of estimates of error found from mean 
ranges (see, for example, David, 1951, pp- 408-9). vis generally not much smaller than the 
number of error degrees of freedom in the analysis of variance. (c) Extensive investigations 
by quadrature methods made by Lord (1950) show the power of his w-test (Lord, 1947), 
mploying a range estimate of variance, to be little less (on the systematic model) than that 
of the corresponding t-test, especially when mean ranges are employed. 

hese considerations suggest that the use of a mean range instead of a root mean square 
аза 'Studentizing statistic leads to a definite but small loss in power. This will be confirmed 
In the sequel. E will also be shown that if a random probability model is relevant, range 
Methods are somewhat less powerful than the standard procedure. For the systematic 


Case attention is confined to a number of special alternate hypotheses. The g-test, whilst 
always retaining the property of unbiasedness (in the Neyman-Pearson sense) of the F-test, 
15 found in certain rather special but important practical situations to be more powerful 
than the F-test. 


2, THE RANDOM MODEL 


We Consider first the random set-up 

ty = K+ titz @ = 1,...,Ь;] = L...,m), (1) 
2; are all mutually independent normal deviates with 
Ü 


Where Ki у: 
nd the %;, 
18 a constant an E f the 4-terms in (1) makes the variance of 


Vari 
Tiances g-/2 (for u) and о? (for z). The presence о 
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the means z; equal to (02 + mo"?)[m instead of o?/m: hence the usual ratio of the prem 
groups mean square to the within-groups mean square is distributed not as F bu is 
(1 -- те) F, where б = o’/c, the degrees of freedom of F being v, = /—1, v, = ((m— 1). т 
this case the single-tailed F-test is known to be the uniformly most powerful test of the nu 
hypothesis that ¢ = 0 against alternatives for which б> 0. | | 
When range methods are employed, we test the null hypothesis by referring the ratio 


_ Am range (2,) (2) 
и — wje 
to tables of percentage points of the ‘Studentized’ range q (Pearson & Hartley, 1943; 
May, 1952; Hartley, 1953). Since the presence of the w-terms multiplies the standard дет 
tion of mz; by the factor \/(1 4-m£?), q, will be distributed approximately as /(1 +m?) d: 
where 4 is the ‘Studentized’ range for sample size / and degrees of freedom v. 


Table 1. Power of the q-test for a simple classification (random model); 
power of corresponding F-test = 1 — f 


la. а= 0:05, 1-й = 0-90 lb. «= 0-01, 1—2 = 0-90 
m aum іре? "Wr ee 
m 3 6 © d 3 6 © 
1 17%. 
4 0-90 0:90 0-90 4 0-89 0-89 0-89 
6 0-86 0-88 0-89 6 0-87 0-88 0-88 
8 0-85 0-87 0-87 8 0-85 0-86 0-87 
10 0-83 0-86 0-86 10 0-83 0-85 0-85 | 
aa Ее. 
lc. а = 0:05, 1— 8 = 0-75 1d. & = 0-01, 1-f = 0-75 
m “т 
1 3 6 со | D 3 6 ка 
= — — ~ m 
4 0-74 0-74 0-74 4 | 07 .74 
6 0-68 0-72 0-73 | 6 | б КЕН фи» 
10 0.63 бә | 11 8 0-68 0-69 0-69 
| 0-69 10 0-64 0-67 0-67 


As pointed out by Patnaik (1950) the power 
tables of the probability integral of q and m 
sponding F-test above. We do this by empl 
the power function of two different tests of 
an assigned significance level с; we find the 
a selected value 1 — f; we then find the test for t] lue of d 
F(a; v4, Va) is the upper 100% % point of F for d ope ог the same value the 
valua GPE is given by egrees of freedom V4, vs, it follows that 

F(a; v, %)І (1 =; v, »,). 
probability points 


function of the q-test; can be obtained from 
ay be compared w 


Oying a genera] 


l4 тб? = 


Hence, using a similar notation for 
same value of £ is 


Pr. [J(1 +m?) > q(x: 1, DE 


he 
of q, the power of the q-test for * 


Ре, Е 
т. [9 > q(o; 1, РИК – 8; Vis va)/ аҙ v, и) 


> да 
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Results for the cases ж = 0-05, 0-01; В = 0-10, 0-25 and selected values of 1 and m are 
givenin Table 1. Thus from Table 1а we see, for example, that in testing for the significance 
of differences in means in eight categories, each containing six observations (l = 8, m = 6), 
а power of 0-90 given by the standard test is reduced to 0-87 when the present q-test is 
employed. It is evident from the table as a whole that the loss of power is small, especially 
in the commonest situations where the number of groups / is not very large nor the number 
within groups m very small. The effect of | on the power is clearly more marked than that 
of m. As m — cothe problem reduces to that of comparing the range of the group means with 


the between-groups mean square as criteria for detecting departure of the between-groups 


population variance from c^? = 0. 
2:1. A modified form of the test 
It is of some interest to determine the loss in power when only the denominator of the 
(since presumably most of the reduction in power is 


upra : 
F-ratio is replaced by a range estimator 
ator of q). The resulting test criterion is 


due to the use of a single range in the numer 
ту, — 7) / (l0), 

as F with degrees of freedom /— 1, v. Using а signi- 
the ordinary F-ratio test returns as significant 
the Incomplete Beta-ratio 


Which is distributed approximately 
ficance level ж, its power to detect а о”? which 
With power 1— р, is given approximately by 

Шт —1),30— 1) = фл, 3»), | (3) 
Where ини", Р = Fs n9) РА – f My Me) Fle vy vy). 


We have confined ourselves to the case m = 3 for which the drop in power due to the use of 
To two-decimal accuracy the results 


the ‘Studentized’ range criterion is most substantial. ( 
are the same for / = 6, 8, 10.) 


1-24 1= 6, 8, 10 

Е | i ч | | 

7 a 
“a á 0-05 0-01 
ү 0-05 0-01 ғ. % 
0:10 0-90 0-90 | 010 ed 089 
0-25 0-74 0-74 | 0:25 774 0-73 
| | | 


As expected, the loss in power is small, and there may be circumstances in which the present 
, 


Ybrid test criterion is of practical value. 
2.2. Test for a double classification 


Very similar results to those in Table 1 are obtained for the double classification 


@=1,...0 9 = 1, ғат). 

These are set out in Table 2, which is relevant to the power of the ‘Studentized’ range test 

for the existence of the и; (the power of tests for the v; follows by symmetry). m has been 
aken аз 4 7. оо go as to give F-ratios with the same degrees of freedom as corresponding 


“ntries in Table 1 with m = 3, 6,90. 
2-3. Power of modified F-ratio test for interaction 
Wes 
terion of the form А = (аша 
ely as an F-ratio with modified degrees of 
23 


xy = К +и;+% +2 


2 
In бену бена — в a ) arises through the use 


of Tange methods, И is distributed approximat 
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freedom vj, у; which, as well ав су, с», are obtainable from appropriate tables given by 
David (1951). It is clear that a power function comparison of the F,- and F-tests may be 
made precisely as in (3). We have taken two cases typical of a split-plot experiment, but 


Table 2. Power of the q-test for a double classification into 
l treatments and m blocks (random model) 


2a. а = 0:05, 1—/ = 0:90 2b. а = 0:01, 1—2 = 0:90 
т “т 
q со 
қ 4 7 со 1 4 
4 0-90 0-89 0-90 4 0-90 0-89 0-89 
6 0-88 0-89 0-89 6 0-89 0-88 0-88 
8 0-87 0-87 0-87 8 0-87 0-87 0-87 
| 
2c. а = 0:05, 1—2 = 0-75 2d. а = 0:01, 1-6 = 0:75 
an m 
ых 4 7 со 1 4 7 со 
4 0-75 0-74 0-74 4 0:75 0:74. 0:74 
6 0-71 0-72 0:73 6 0-73 0-71 0-72 
8 0-71 0:71 0-71 | 8 0-71 0-70 0:69 


similar results will hold in general. We consider the comparison of ‘error (1)’ against 
‘error (2)’, which is often carried out as a preliminary test of significance: 


1-5, т=4, п= 3 


a a 
0-01 0-05 ~ 
ғ 


1= 3, т=3, n=5 


3 үз. | 0-01 0-05 
0-10 0-87 0-88 0-10 0-87 . 
0-25 0-71 0-72 0-25 0-72 073 


— — 2... 24 
3. THE SYSTEMATIC SET-UP 
We confine ourselves to the simple classification 
ty = К+А +, (і-1, wah mL. 


where the systematic effects A; are sub 
be seen to hold for other models. 


от), 


ject to the condition ХА; = 0. Similar results will 


The case of two groups (1 = 2) need not be considered here аз the test ratio 


vm |, —z, | (e), 

and is equivalent to Lord's u-criterion appropri 8 к. 
of the difference of two means (sce § 1), Ppropriate to a two-sided test for the significance? 
For 1 > 2 по simple procedure for the evaluation of the 
We shall proceed by illustrating that, as in $2-1, the los 
denominator alone is small, and by then inves 
introduced by the single-range numerator, 


(2)is simply 


power of the q-test seems possible: 
aoe s in power due to the mean-rang? 
tigating the more important changes in power 
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From Tang’s (1938) tables it is easy to obtain the reduction in power when the degrees 
of freedom in the denominator of the F-ratio are reduced from /(m — 1) to v, to give what we 
may term the Fp-test. We give two examples in the manner of Tang's table with 

ф = /(т®А%/1). 

Example. (i) | = 6, т = 6; (ii) | = 8, т = 3. 

Values of 1—8 


а = 0:05 a = 0-01 
(i) | (i) | | | (i) | w 
Е | Еһ F ғ, | | к | ғ, Р Pr 
| —— 
. . 0-824 | 0-810 | 0-762 | 0-736 
15 | 0-738 | 0-728 | 0711 0-696 2:0 i 
20 | 0-952 2.5 | 0-974 | 0.969 | 0-952 | 0-941 


0.948 0-942 | 0-937 


8-1. The power ој the range test 
A consideration of the numerator of q alone is clearly equivalent to an investigation of an 
ordinary range test, i.e. one for which т is known. (2) now reduces to 
Alm range (2;)/0; 


à non-central range if the expectations of the z; differ. We shall write, using a dash to denote 


non-centrality, w' = range (у), 


where the y; follow normal distributions with expectations о; (where Da; = 0) and unit 
t 


Variances. 
Writing 2(2) = еті 


1 © 1 и TEE )dy ) ауы (4) 
В(и’) = A Arcem Ц, М һ— €) ЧУ» | 4% 
qm =" 
nificance level а, namely, 1—P,(w,), is а symmetric 


z^ | (27), the probability integral of w' is given by 


The power (1 — f) of the range test at sig 


funeti c ions 0. | 
E AH = 0, the power is also а symmetric function of any /— 1 of 
1 


i i ding probability integral 
the g. to be expressible (unlike the corresponding p: lity integ 
fics omc ы. Парана ми Хо2)ава function of less than these /— 1 independent 


Parameters (compare $4). | 
We therefore confine attention t А ticable 

9f P(w ) by approximate quadrature is practicable. i 
(a) The EN a single outlying mean. For this model we may write 


a; = – ul (= 1,2—1), a, = (L— 1) 1. 


o a few important special cases for which the evaluation 


From (4) we have after а simple translation 


э sti ep dE кани (уи) | ах 
Flw.) = (I- nf” «eJ. z(£) ) (|; қ, д! (и 48) а)" ra 


sents the contribution to P(w,) when the outlier 


power of the corresponding x?-test was obtained 
23-2 


The Second term is very small, as it repre 
Blves rise to the smallest observation. The 
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from the tables of non-central x? given by Patnaik (1949) and Fix (1949) and from the 
v = со line of Tang's tables, with A = Za? = (1—1) 2/1 and à = (A/l). In а few cases 
Patnaik’s y?-approximation to x’? had to be used. 


Table 3. Comparison of x?- and w-tests for a single outlying mean. 
Values of the power (1 — fl) 


Е - 
2 6 10 
1 
a ф V 
| №, ш am w № w 
| 0:05 3 0:563 0:53 0-56 — — 
4 0:807 0-82 0-83 0-76 0:80 
5 0:942 0:96 0-97 0-94 0-96 
6 0-989 — — 0-994 0-996 
0-01 4 0-600 0-62 0-63 0-54 0-61 
5 0-831 0-88 0-88 0-82 0:87 
6 0-952 0-98 0:98 | 0-97 0:98 
| 


It will be seen that except for 1 = 2, when the two tests are equivalent, the range test is 
slightly more powerful than the x?-test. This is not surprising in a test for outliers and а 
similar conclusion holds when, for example, there are two outliers, one at each епа.“ 

(6) The case of two equal clusters. For simplicity take Z even, then this set-up is defined by 

а; = – ће (j= 1,..., M), 


= Зи {= И+1, vod), 
and Z(w,) given by н 


Fea) = [^ sn (f^ pa)" (mma) as 


+ УГ. z(x — и) е #(Ё) а) į (^ 2() — и) a) = dx. 


A procedure similar to that in (а) yields the following comparisons: 


& = 0:05 
| | r= oo 
1 2 6 
| | 10 
| 
D № | 
Ф х, ш | ж w ж | 77] | 
| CREE M 2% 
D. 3 0-563 | 0-823 0-74 
| 4 0-807 0-981 | | 0.94 0:82 
| | | ТЕ | 0-999 0-98 


For 1 = 2, this model is identical with (a). In other cases the range method is less powerful. 
* Results reached by Dixon (1950) are of interest 
outliers, including the above w- and xy?-tests 
finds the range test superior for all A. Ho 
way as our ‘power’, as he discards all ва; 


В : r 
In this context. He plots for various tests 10 


жер. и, "Dance" curves obtained by sampling methods 27, 
mples ыгы таер is not measured in quite the 587? 
which z, is not the largest value in the sample- 
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4. SoME GENERAL COMMENTS 


We now i м о; e 
consider to what extent the conclusi i 
к к a » ; sions regarding tl of $3 an 
Exe : Е g g the range test . be 
зк. занн into conclusions about 'Studentized' range tests. Case (а) is of ir 7 Ји i ^, ug 
" © È 1 | А 5 
5 5 that when the models there discussed are relevant a better test than th vi à i 
i he standard 


one { i ; 
of the analysis of variance сап be obtained by replacing the F-ratio by 
- Am range (2,)/8, 
where сз = 12 
| в = J(EX(vjj — &]l(m — 1)}, 
givin се " э г 
= = Ё Studentized range test with root-mean-square denominator. The test criteri 
| а (%;)/(W/c) may also lead to a more powerful test, especially when the т кз еи 
m es of freedom in the numerator is large, but the issue is somewhat obscured b vite or 
du n power due to the use of w/c in place of s. This does not, of course ween Өз ws E 
n : i 5 і à латини 
Ноу um pr operties of the analysis of variance tests proved by Hsu (1941) and Wald = 
cA ed it shows that the non-central range distribution does not depend on а Add 
herwise Hsu's resul it i үс жергі: 
Fast. s result would not permit it to lead to a more powerful test than the 
These c i 
| omments provide no rigorous conclusions. It would b i 
dis c й е possible, followi i 
А lar to those of Hartley (1948), to obtain exact upper bounds for the ева реј sts 
p M functions of the F- and q-tests in terms of the maximum difference чы s ihe 
E. functions of the y and range tests. But as the distribution of non-central ran: Nes ке 
lable, this approach cannot at present lead us to any numerical gauge of а 2 


My thai g 

nks are due to Dr H.O Hartley for his uidance i is wi i 

: . U. 5 in this work whicl wa arried 
t at University College, London. ы —n 
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SOME SIMPLE APPROXIMATE TESTS FOR POISSON VARIATES 


By D. R. COX 
Statistical Laboratory, University of Cambridge 


1. IwTRODUCTION 


If events, such as accidents or stoppages of a machine, occur randomly in time at a true 
rate A, the number of events in a fixed time ¢ follows a Poisson distribution of mean At. 
In inverse sampling, with n fixed, the time t up to the 2th event is distributed as (2271 Xin 
where 43, denotes a y? variate with 2n degrees of freedom. Barnard (1946) has pointed out 
that the latter result leads to convenient ‘sequential’ tests of hypotheses about A. For 
example, if we have inverse samples (ny, 4) and (ng, tz) from two populations, the hypothesis 
that A, = A, is tested by referring F = тој ту to the variance-ratio tables with (2n,, 27) 
degrees of freedom. In the present note we show that, with a slight modification, Barnard’s 
tests apply almost exactly to direct Poisson sampling in which #18 fixed and n is a random 
variable. We obtain, in particular, a convenient test for the equality of Poisson means; 


although the main use of the method is likely to be in more complicated situations where 
A is the product of unknown parameters, 


2. THE BASIC APPROXIMATION 


In direct Poisson sampling in which the number of events occurring in a fixed time 18 
recorded, we have 


Е о eàt (At)r i 
prob (no. of events > п) => > prob (5 х2. < ) У (1) 


ы 
1 4 
prob (по. of events >n + 1) = prob (s x) « ) Р 


and (2) 
If we wish to make an approximation to prob (no. of events > т) in which the number of 
events is treated as a continuous variate, it is reasonable to take a quantity intermediate 
between (1) and (2). A natural choice is 3 d 


1 
prob (no. of events > n) ~ prob (а №, TS ) : 
i.e. we calculate probabilities аз if 


2At is distributed as Хаи (3) 

Thus the suggestion is that if we observe же i 4 
vent i f 
mately as if ә is fixed and 221 is dist; m e ао е жамы) с 


ributed 2 i : con 
cerned with the consequences of (3) iwn The remainder of the paper 8 a 


4(% +4)А 
ta(n +4) Fs 5 


| 
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is distributed approximately as F with (2n, + 1, 2na + 1) degrees of freedom. Thus we may 
test the hypothesis that A, = A, by referring 


1(%» +4) 
ta(n +4) (5) 


ю the F tables with (2л, + 1, 2, +1) degrees of freedom. Also (100—22) % confidence 
intervals for A,/A, may be obtained from 
ta(n +9 р. < № <2 
Ла) As 
where F, Е, аге the lower and upper % % points of F with (2n; +1, 2п, + 1) degrees of 


freedom. 

Example. Observations of the spinning of one batch of wool gave 5 ends down in 200 
Spindle hours and of a second batch 12 ends down in 180 spindle hours. 

Assuming that the occurrence of ends down is random for each batch, we may test for 


(6) 


the difference between batches by 


200 125 _ „= 
Fer ps 7 20% 


With (11, 25) degrees of freedom. The 5 % point in the ordinary F tables is 2-20, the 22% 
Point 2-56, Thus in a two-sided test the difference between batches is very nearly significant 
at 5 9. The lower and upper 23 % points of F with (11, 25) degrees of freedom are 1/3-16 
and 2.56, so that by (6) а 95 % confidence interval for the ratio of the stoppage rates is 
180 x 5:5 1 № 2 180 х 5:5 
300x125 $16 А, 200x125 
0-125 <A,/Ay < 1:01. 
The test of the difference between batches could equally well be done by the conventional 
22 method, but the provision of a simple confidence interval for the ratio of the stoppage 


Tates is a useful additional feature of the present approach.* 


9.5 


= , 


ie, 


4. ACCURACY OF THE TEST 


ted to be accurate for large samples; its accuracy for small samples 


The test (5) may be expec 5 
tion of т, n corresponding to given Aj, Ag, 


may be investigated as follows. The distribu 


t 
1 and f, is exactl. (Ay ty)" at, (Аа) 
2 Y prob (пл, То) E gu с“ mb а (7) 


е test (5), we can then find the exact probability 
П points in the critical region. Table 1 gives the 
ded test of the hypothesis A, = A, against 


уе first determine the critical region of th 
associated with the test by adding (7) over а 
Tesults of such calculations. In all cases 2 one-si' 
alternatives А, > A, has been examined. | 

һе general conclusion from Table 1 is that except when the population means are very 
Small, the approximate F test gives the probability of errors of the first kind sufficiently 
Accurately for practical purposes. For samples of the same size, the test may be considered 
satisfactory at the 0-05 level if the true mean exceeds one, and satisfactory at the 0-01 level 


© true mean exceeds two. 
Е ж Note added +. у. The derivation of confidence intervals for the ratio of Poisson means has 
е in proof. 
cently been consi B red in detail by Chapman (1952). 
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i tural to compare the approximate Ё test with the exact test of аи 

"s sig 1940), which is based on the distribution of 75, n, conditional on n pt Т. he 

kir. cis is difficult because the latter test is discrete and the true size of the A 

ч say, the 5 % level is appreciably less than 5 %, when small numbers are сине ме 

The critical region of the approximate F test appears always to consistiof the шыт „= E 
of the exact test with some additional points. It is thus roughly equivalent to Barn 


C.S.M. test (Barnard, 1947). 


Table 1. Exact probabilities associated with various nominal significance levels of the 
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approximate Р test of the hypothesis A, = A, against alternatives Ху > Ау 


(а) Samples of equal size, t, = ty 


Population 
mean, 
АҺ = Agte 


Nominal significance level 


0-1 


0-05 


e л оюк юк 


0-101 


== 


0-031 
0-049 
0-059 
0-065 
0-062 
0-058 
0-054 


0-01 


0-001 
0-004 
0-007 
0-011 
0-012 
0-012 
0-012 


(6) First sample size the larger, t, = 3t, 


ғанды; Nominal significance level 
population 
mean, At, 0-05 0-01 
1 0-048 0-008 
1:5 0-055 0-009 
2 0-055 0:010 
3 0-056 0-011 
4 0-056 0-010 
5 0-051 0-010 
(c) Second sample size the larger, lj = 34 
Smaller Nominal Significance level 
population 
mean, At, 
0:05 0-01 
1 0-012 
155 0-038 3510-5 
2 0-056 экиз 
3 0-059 0-002 
4 T 0-010 
0-059 
5 0-057 0-013 
0-012 
pe se 
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The approximate F test is an interesting consequence of (3) and may sometimes be pre- 
ferred to conventional approximate y? methods. The detailed discussion does, moreover, 
Show that (3) may lead to accurate results even in very small samples; in $7 we shall consider 
àn application where a great simplification is achieved by the use of (3). 


5. CONFIDENCE INTERVALS FOR A SINGLE MEAN 


The approximation (3) may be used to test the hypothesis that A = Ay, or to obtain con- 
fidence intervals for A. An interval of confidence coefficient (100— 22) % is thus given 


approxima 7 2 2 
Pproximately by Фуа <А (8) 


where X? | are the upper and lower  % points of y? with v degrees of freedom. This may 
be compared with Garwood's (1936) confidence interval, which in the present notation is 


0.2 < AS Ааа, + (9) 


The interval (9) has a confidence coefficient of at least (100 — 2) % whatever the true mean 
At; the true confidence coefficient is a serrated function of At which, for small At, has most of 
its values appreciably greater than (100 — 22) %- . 

The interval (8) is always narrower than (9), and the true confidence coefficient is some- 
times less than (100— 22) %. In many practical applications it would be reasonable to 
assume that over a number of applications of the method the true means At are distributed 
randomly with respect to the serrations of the graph of the confidence coefficient. In this 
Саве it would be justifiable to use a confidence interval for which the average confidence 
Coefficient over any fairly small range of At is at least (100 — 2%) %. The interval (8) appears 


to satisfy this condition provided that n and с are not very small, and it might therefore be 


Claimed that (8) is preferable to (9). The point is, however, of little practical importance 


because the reduction in width from (9) to (8) is only small. | : 
Example. Seven stoppages of a machine are observed in a certain period. 90% con- 


здеп етік forthe population number of stoppages are, according to (8), (3-63, 12-50) 
and according to (9), (3:29, 13-15). 


6. RELATION WITH INVERSE SAMPLING 


The close connexion between the tests given in$$3and 5 and those based on inverse sampling 
as been mentioned briefly in 551 and 2. This connexion will now be discussed in more 
detail, + | 
Tests based оп the measurement of the intervals between randomly occurring events have 
cen described in full by Maguire, Pearson & Wynn (1952). In емін their test for the 
Significance of the difference between the rates of occurrence in two ser ies is as follows. Let 
"a % be fixed and let t, tẹ be random variables defined as in times in the two neries up to 
е 7, nth events. Then F = тт may be tested exactly asa i ratio with 
(2n,, ің) өрте of freedom. The test of § 3, on the other hand, is that i һ tare fixed dinem 
ànd if random variables та, 7 are defined as the numbers i — rie ше h Hmm а A 
hen p. t (n, 4- ыт” 1) may be tested approximately 


(n, + 1, 2n, +1) degrees of freedom. 


* for some helpful comments leading to the addition of 


thi Tam very grateful to Prof. Е. S. Pearson 
55 section, 


358 Simple approximate tests for Poisson variates 


In the first case the interval /, ends with the ,th event, but in the second case this is not 
in general so. The extra degrees of freedom, and ће 4’s in the definition of №, may be thought 
of as accounting for the periods between the last events and the close of the periods of 
observation. In many applications in which 5, (аге fixed, the precise instants at which the 
па, Noth events occur would not be known. 

It will frequently happen in applications that neither ¢,, t, nor n,, n, are fixed in advance 
of the observations. If, however, the periods ¢,, t, are determined by some random process 
that is quite uninfluenced by the numbers of events occurring, then conditionally оп ty, tz 
the quantities 74, n, follow Poisson distributions and the approximate F test may be 
applied. Similarly, if the time intervals are measured up to the л, noth events, where 21, Па 
are determined by a random process independent of the observations, then the exact F 
test with (2n,, 2л) degrees of freedom is applicable. 

There are many other possible ways in which 4, tą might be determined; for example, 
1, t, might be chosen by some random process correlated with, but not completely deter- 
mined by у, пр. In such cases it is not possible to find the properties of the above tests 
without special investigation, although a reasonable general rule is to use the first F test, 
with (274, 2n) degrees of freedom, whenever the intervals ,, ty are ended by events, and the 
second test otherwise. This rule breaks down in extreme cases; for example, if t}, t are deter- 
mined by a likelihood-ratio sequential test for comparing the rates of ion it would 
clearly be entirely inappropriate to apply either F test, 


7. THE LOGARITHMIC TRANSFORMATION 

The basic approximation (3) is very conveniently expressed in logarithmic form, and 8° 
may be expected to be useful whenever A is the product of unkno fri 

wi is any 

function of n we may rewrite (3) a le 


ћ 


lur = log A+ logf, қ log ЗА. 


The log x? distribution has been studied in detail by В b 
Wishart (1947), who have, in particular, shown that кашы. кай prm uu 


Blog 3x; = Vv) varlogły? = ЖЕ 
where у, У" are the digamma and trigamma functi i 
Ons. i | 
logf,/t is ап unbiased estimate of logA. Thus we ao jon kr ea че 
present purpose it is accurate enough to write о куше " 


о = O14, ја => (m = 1,2,...), (10) 
Also var log! = (пф) 493 (n = 0), 11) 
| ln (n= 1,2 ) } ( 
Thus if we define a transformed variate by јање 


" 0-14 
bed. S Yi s 


^ pa a2) 
ES log; if n + 0 


—-°С—+є—-_-__-—-—— 
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then approximately Е(г) = 1054 and varz = (logoe)? v; 
where ъ= 493 (n=0), 
= (n+ x = 


One use of the transformation will now be described. 

Suppose observations. of stoppage rates are made for k machines on each of which stop- 
pages occur at random, the rates for the different machines being different. Suppose that 
а change in the process is introduced and that fresh observations are then made. In some 
cases it would be reasonable to expect а constant proportional change in stoppage rate, 
1.6. to expect that if the initial stoppage rates are А), ...,A, the final stoppage rates will be 
РА, ...,pA,. If the observations are, in the initial period (n4, 4), ..., (т. ¢,), and in the final 
Period (nj, t), ..., (Ni tk), we may define transformed variates 21, ...,2, and zj,...,2; as in 
(12). Then u; = z; — z; has mean д = log) and known variance (109106)? (v; + v) = (logy €)? w; 
say. The best estimate of д is thus 


YXu,w;! ^ (logue)? 
ene. VANS aa 
шені” B= "Suz? (14) 


An almost unbiased estimate of the parameter p is 
7] | 1 : | 104 
‘= Xw; d ( 1 5) 


An approximate test of the hypothesis of a proportional change in stoppage rate may be 
Obtained by calculating 
d xia = (ep 09) loge)? (16) 


Example. The following data were the results of a sampling experiment with p = 2, 


the values of Т, Т being chosen arbitrarily. 


Initial Final 
$ Uy 1ш; 
т, (hr) n; Т: (hr-) ni 

2 28 5 0-428 1:429 
i rhs 4 41 21 0-522 3-360 
3 21 2 24 3 0-118 1-200 
45 18 16 1 0-235 6-828 
5 30 4 32 12 0-449 3-000 


Zup? = 15817, Хирш" = 54611, Xulw;!- 21774. 


The last two columns give 


ту. —1 = (v, Hv! = n,ni[(n; +; 
п, = 6—21 = logo Teil Pim and wj!- (v+; an| (n; + mi). 


n 
9 frequency being zero. 


дела иг" = 0945 and var ft = (0-4343)?/15-817, 
= d" 
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so that standard error (0) = 0-109. There is thus good agreement with the true value 
и = logy) 2 = 0:3010. p is estimated by 


The hypothesis of a proportional change in stoppage rate is tested by 
Xi = (2-3026)? Улог (и; — 7): = (2:3026)? (Ур tu} — (Улог 1) д} 
= 1-55, indicating a good agreement. 


This problem could be tackled without introducing the device (3) by the method of Dyke 
& Patterson (1952), using an iterative solution of maximum-likelihood equations. The 
present method is considerably quicker. The methods are probably asymptotically equt- 
valent when the n; and ni are all large. The transformation (12) can be applied in 2 
similar way to more complicated problems involving Poisson variates, 


I am grateful to Mr D. A. East and Miss Patricia Johnson who did most of the cal- 
culations. 
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ORTHOGONAL POLYNOMIAL FITTING 


By JOHN WISHART anp THEOCHARIS METAKIDES 
Statistical Laboratory, University of Cambridge 


1. There are considerable advantages to be gained by using orthogonal polynomials 
when fitting curves to data by the method of least squares. The computer whose data are 
equally spaced, and of equal weight, is well catered for by tables, the best known of which 
are due to Fisher & Yates (1938-53), extended by Anderson & Houseman (1942); similar 
tables, due to Van der Reyden (1943), have been partly reproduced in the new Biometrika 
Tables for Statisticians (Pearson & Hartley, 1953). In the general case of unequally spaced 
and weighted data similar tables are impracticable; what the computer needs is a good 
calculation scheme, so designed that it makes little appeal to the memory, or the mathe- 
Matics, and is as far as possible self-checking. There is only one mathematical basis for the 
Methods, but a number of computing techniques have been devised, chiefly in order to carry 
Out the inversion of the matrix whose elements are the equivalent of the variance and 
Covariance estimates in the parallel problem of multiple regression. The only scheme known 
to the authors as having appeared in Biometrika is due to Isserlis (1927), but this needs 
revision. Elsewhere there are protagonists of the Gauss-Doolittle methods, described in 
More than one text-book; of the pivotal condensation method due to Aitken (1933, 1937); 
and of the Choleski method dealt with by Turing (1948), Fox, Huskey & Wilkinson (1948), 
Fox (1950), Fox & Hayes (1951), Rushton (1951), Hayes & Vickers (1951), and others. The 
- 1950, 1953) deal with the mathematics of orthogonal polynomial 
ation scheme is given in the former, consisting of a combina- 
tion of the Doolittle and pivotal condensation methods. (In fact, the available methods 
Seem to differ only in the order and arrangement of the Ann seem: Е 

Тће purpose of the present paper is to supplement the Biometrika Tables by offering 
а single recommended computation scheme for the general case. Like some other schemes, 
it is set out in tabular or ‘blank’ form, for ease of general manipulation. It is approached 

Tough multiple regression analysis, and to stress this fact the standard regression notation 
has been used. ‘The arrangement is such that every thing that сап possibly be wanted is 
obtained within the tabular scheme. One point which may be considered convenient, ев. for 
Plotting purposes, is that while the orthogonal polynomials are calculated, the result is also 
Blven at any stage as a simple polynomial. One of the authors has examined the alternative 
Schemes in application to the illustrative example given later in the papar, and the con- 
clusion is that the method and scheme now offered are the most convenient for use by the 


recent, papers by Guest ( 
fitting, and a practical comput: 


а 
Verage computer. 


2. We begin by outlining a useful method of conducting a multiple regression analysis 


Which brings to account one independent variate at a time, so that the successive terms of 

е fina] regression equation are orthogonal to one another, although it can in the end be 
Converted to the usual form. Although well known, computational details of the method 
are not very readily accessible. They were published a few years ago by one of the authors 


(Wishart, 1 950), while Woolf (1951) later gave practical directions for regression calculations 
by successive АВИВА of additional variates, using throughout the Gauss multipliers. 
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Consider a dependent variate z,, usually assumed for significance test purposes to be 
normally distributed, and independent variates z,,2,,...,v, ,, usually assumed (for the 
same purposes) to be fixed; these are intended to be dealt with in the order stated. Les 
w denote the weight to be given to a sample observation. Add an additional ‘independent 
variate v, having a fixed value unity. Let X denote a sum of weighted products (or squares) 
of the designated variate measures taken over the sample, the size of which is N. 

For p, q = 0,1, 2, ..., we write 


Spa = Х(шш,24), (2-1) 
and then calculate in succession the partial sums 
Spro = бы - 800/800 (р = 1,2,...;n), 
брод = Sp20—-Sp1-0S21-0/ S110 (р = 2,3,...,n), (23) 


Sp3.012 = 8301 — Syeei 8201192201 (p = 3,4, тайт), | 
10103 = 64012 — ©рз-о1ә бз-отә/!8зз-о1ә (p = 4, “+ 


and so on. The displayed formulae (with n = 4) carry us as far a: 
apart from zy. We have in particular 


, 


8 three independent variates, 


Soo = E(w), & TE X(wz,). 


The estimated regression coefficients, of progressively increasing order, are then calculated 
in turn, thus: 


бю = Snol Sos 
Pro = 80/810 (2:3) 
база = n201/ S2201; 
bison = Sns-or2/ Өзі) 
and so on. In particular bno = X(wz,)|X(w) = gs 
Partial z-variates form the next sequence, and are caleulated ag below: 

To = Xp — Spool Soo; 

ро = tpo 7 ш] бу, ee 

Брдо = Тр — 20120118301, 


and во on. The first of these is 20-2,-2,, 
The multiple regression equation, written аз а sum of ort 


X, 


hogonal components, is the? 
= bno Tot baro Tio +b. 


2:01 5.9, + ТА зоо +... = 


earlier stage, also that at any stag 


t this test 
tested against the residual mean 


ed 


Could have been applied 
Square available at 


че а single regression соейїсїё 
that Stage, 


Emus... ——— ——— 
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Table 1 
Regression Degrees of Mean 
coefficient freedom Sums of squares squares 
bno 1 бо по = ла — Sanno 
baro 1 В, 1-05 по = Бањо – Sano 
n2-01 1 bus as = Жалт Sano f ањо Sanos т, 
buses 1 29-0129 n3-012 == nn01277 Samos 
Residual N-4 Snn-o123 Y, 
Total N San 


In practice it is helpful to choose as x, that variate which is most nearly expected to be 
associated with x,; then as х, the next in expected strength of association, and so on. The 
Calculations can be reviewed when a fitted term is found to be insignificant; the appropriate 
variate may be discarded and another substituted. As a result of this procedure it may be 
Possible to say with & minimum amount of labour that x, is adequately ‘predicted’ by 
а limited number only of independent variates, a matter of some importance in a variety 
of problems, for example, in physical anthropology where large numbers of body and other 
Measurements may be available, and for practical reasons it is desirable to deal with a small 
number of variates only for purposes of description. 

The more symmetrical arrangement of the regression equation, giving all the partial 
regression coefficients of the highest order, and also tests of their individual significance and 
В: final check, require additional calculations which can be readily inferred from the applica- 
tion of the method which is made in $3 to orthogonal polynomial fitting. 

It will be noted that the first stage in the scheme involves the determination of the 
Weighted sums of products (and squares) of deviations of variate measures from their 
(Weighted) means. That is, Spgo is determined from S,,, by removing the usual ‘correction 


ра 
Actor! S, S 2/5. If the quantities Sj, are given in the first instance, one stage in the 


computations is eliminated. | ) 
. 3. Turning now to the problem of orthogonal polynomial fitting, we read т, as y, to 
Stinguish the dependent variate. We then let 


я аай 
ар = 20 = 1, тү = жї ==, 2. = T“, ..., 


So that the independent variates become the powers of v, including the dummy variate. In 
© special case of n = М, the sample size, so that there are N — 1 independent variates, the 
Multiple regression equation will then become the polynomial of degree N — 1 of perfect fit. 
€ usual case is, however, where we endeavour to fit a polynomial of degree p, where 


РЕМ 1, We then have 
8,5,0 = Зна > 7 
Where » and в take independently the values 0, 1, 2, 
б = Х(шул”), needed forr = 0,1; 2; «3P (3:2) 
.,p as well as for n = y, and we shall also find 
.. which are defined below. In particular 


Qmm = Seres = (02745), (8-1) 
...,p. We define also 


Convenient to calculate the quantities бра» 63012» - 
7018 the mean value of the 7th powers of the 2’s. 
In Virtue of (2-3), equations (2-2) may be calculated as 
— 6,010 (r = 1,2, ..., p and y), | 
(ғ = 2,3, ...,р апа у), (3-3) 


(r = 3, ..., p and y), 


Equation; (2-3) will be needed for n = I, Dy 


Sao = Уа 


8,501 = 20 бло Soro 
S,3.012 = ©7301 — бо S3201 
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and so on. Equation (2-5) becomes, in the present case, 
У = byte +byrotro+by201 €201 бұзба воле +... (3-4) 


In Fisher’s notation (Fisher, 1925) the successive orthogonal polynomials are & = 1, £1 
Бр, ..., with multipliers A, B, C, ..., so that the fitted polynomial is 


Y = А+ ВЕ ++ Dét... (3-5) 
From equations (2-4), by inserting b-coefficients, we have 
210 -21-б, 
Тәр = Tao — бой, (3-6) 
23.012 = ®зоу — 32.91% 2.91; 


and so on. In virtue of the relations 


E= by, 208 = 05, Е... 
bao — бобо = booa; | 

bao — 531-9910 — 632.01 бода. = бар; (87) 
бзо- бару бол = Базов, 

and so оп, we find, after appropri 


ate substitution and simplification, replacing to, $4, To: 
by 20, 21,22, ..., that 


20 -%-і, 
to = = – +, (3:8) 
Фо = 55 = — b1 — barot +22, 
T3012 = 53 = — бол — бара — Dao, ж? + 23, 
and so on, and that Зуи чын эн 
A=by, B= бл» C= баю» D= буво» +++ ~ 


By combining we may write down the different terms of Y 
cumulated one at a time, thus having the appropriate fitted cu 
a simple polynomial in 2, as follows: 


Y, = byos 
Y, = бол + by 1.92; 


separately, or they may © 
rve at any stage expressed 28 


3.10) 
У, = bas Боз + бо 22, | 


Үз = буолоз + 61.0232 + 6 зола + by 3.012%, 
and so on, where У, denotes (3-4) taken to degree ғ, The coefficients are calculated fro 
(2:3), putting n = y, and from (3-7) with y written for 2,3.. 
The analysis of variance will be as in Table 1 with radius imated 
у sub; tima 
variances of the coefficients of the power series i desi sue id p 


с eries in (3:10) ma i in terms 9 
the following ‘Gauss multipliers’: шыны 

Co = So, 

боол = Coo — Big Cy, C119 Siw 


бола = Сора — b 
00-12 00-1 -1 29. = 

20:1 620.1: Сп. = су Doo C319, а) 
00-123 = боол — Б, и 


3012039415; = с = 
11:02 — 631.0 31-029 


= #= 
C2201 = 810, 


C2013 = Сы — 
22-013 = Сә b3201 C3201 633.012 = 551, Е 
0123 


2 
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in which 
Cio = —610611-05 Сара = — b201C2201 Coro = — боуобозо1› C3012 = — Өзу1ә©зз-012› 


631.02 = —931-02C33-.012» C3201 = — з2.01С33013; 
and so on. If we denote the ‘mean squares’ by 
ag рој 
Бро У — 1) = 5 у UN 2) =; 


2 
pe 


then, V denoting ‘estimated variance of’, we have 
2. 
V(byg) _ = ©0080: 
2 
V (byo) = Coo Si V(by1.0) = 011.081, 
У(буола) = 6001252» V(byr02) = Сл1:025% 


2 2 
V (60-123) = Coo12353: V (by1-023) = 61-02388, 


2. 
(бузоу) = Co20152: 
2 p Бе; 2 
V(by2.013) = C2201383: V (bys.o12) = 3301253: 
and so on. Finally, it may be useful to have formulae for the estimated variances of the 


A. its 
ordinates of the different order curves, Calling these ordinates Y, У, ..., we have 


V) = соё, 
vÊ) = (Coo £8 + Слао 51) 81, | (818) 
V (Yo) = (со + со 1+ C2201 23) 85, 
У) = (со + Cio Si + 2201 £3 + C33.01283) 88, 
and so on, in which, of course, & = 1 and the values of &,, £j, ... are obtained from (3-8) for 
any value of 2. 

4. For practical computation purposes it is desirable to systematize the evaluation of 
the quantities set out in succession in $3. This leads to the tabular scheme set out in Table 2, 
Which proceeds as far as the fitting of a cubic, and can readily be extended for higher degree 
Polynomials, Owing to the symmetry of the original matrix of sums of squares and products, 
Опе need only enter quantities on and to the right of the diagonals. This is indicated by the 

балу vertical lines in the upper part of the table. A small number of supplementary lines 
are needed to obtain the actual polynomial coefficients; these are on the left immediately 
elow the main table. The column on the right is used for the various contributions to the 
Sum of squares, and from this the analysis of variance is readily written down. The check 


Column is of the usual character for this type of work. The operations of the rest of the table 
tick mark is shown the check is that the 


are followed here, and then in those lines where a А 
number in this column is the sum of all numbers in that line to the left, stopping at the 
балу vertical rule. A useful final check is shown in the lower box lar the right. The table 
Shown in lines 1-16’ is a subsidiary one, giving the matrix of the c’s, inverse to that of the 
з. Only the diagonal Св are required for the estimated variances of (8-12) and (3-13), but 
the Whole matrix has been included for the sake of completeness. It is useful to have all 
е св readily available, particularly when the scheme is used for multiple regression 
ing adjusted coefficients of the surviving inde- 


(See § i lat 
2), since they are needed in calculati : : : 
Pendent bis! 1 E thi redoing the calculations, when it has been decided to omit any 
es, wit: s purpose were given by R. A. Fisher in the 


Such varj ; lae for thi 
ariate, Th -opriate formulae i 
edition of „т Methods for Research Workers, and discussed by Cochran (1938). 
24 
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Table 2 
Check 
column Sums of squares 
8, 8: 8, Syy 
1 Soo Sto 20 30 уо y! 
2 = -bio -і” -ba -byo Мобу 
Б] Sn Sa бу, 
4 = bro — bio Sto = о8о = оу 
5 — bi Siro Sao Syro 
6 со -1 = -ӛле було Syro 
1 Soo Sy; 
8 = bzo PLI "LET 
9 СЕ — bao MULTI = baro Syro 
10 — фра = дао зет бұза 
п Co Carso = бао узла АСЫҒА 
12 S Sys 
13 — bso = 050450 = bao Syo 
14 | оой =bayi бо атысы 
15 бет 2041 баба о 7 932-01 932-01 ҮЗ 3101 Sy2-01 
16 = baoa = baroz бале Syst 
1T бара Сал ог —1 — by yor by sns ни 2) 
18 byo — био 
19 — вала було | МЕ 
20 буол було pe | 
21 £x Lt = Ро буг byoass Sy" 
ІШЕР m Bien гі 
23 7 була буз-ота 7 Өзг узео = baror 22223 Имен уз 
24 byoazs бл оза bysora =A 
17 Co Final check 
2’ БІЛІГ) 
3 Соо-1 
a = фар C201 
5' oo-12 
6| Бао.дасаола 
т Coo-123 
8’ Сі Со 
9 -Әдесея 7 ба-о6з1-0 
10” Слог Сіз 
11/ ЕСЕЛІГІН 7 дат од Сапа 
12’ 10:23 11-023 
13 РІ Са Сао. 
14' | = буду бола — 939-01 Сп өз E Dies ісі 
15 pm nos 6224 
16’ ГАТ сз. 
2032 31:02 59.03 бз 


Subsidiary table giving the св. 
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Table 3 
1 14 = 4. 808 7105 78 8691 538 
dem XU 51:351148 — 550.351143 -5-571429 -62078571 v| | 4345715 
3 = = ап 589 86635 
4| -65 805 -500825 -507 -564915 
—— о — 
5| 16 “as 273645 82 3014385 v 
S - | 
0.030780 7 — 120-382979 -0:387707 -14252246 y 31-7920 
= 99501329 a 
LE. "HN ET 43-8573 308190030 
ГЫТ ВБ — 441036187 — 4473-857: = f 
О 2352 963-6154 | - 354930500 
3180-1824 40602821 -3868:5027 49394501 v 
u 000572834 саны m] P "18623580 0178197 | -1944530 v 69-2300 
= U о Ја ар ~ 
ЕС. es 555 46207 9491143 
5 Зо Poe E -478315393 
= 550-357 145 Z 10609- — 3900055-8: 
M | 80:060361 — 120362010 8 7235-315 —789531-16 
~ 354-395389 218-860190 — 18:623589 „эсс 
16 == бете —94:946 1839548 у 
~ 63-763168 89-477211 — 18:623589 
a ne a = = 22] i 5 5 — 0-99. 5 2 
МЕСТ T "000100790 = | 0005135 4865 у 04875 
18 536-0810 
10 5511429 5 
2520096 0:387707 
0) Sous Зан 09583 
051; 3877 -0 
Bll аршы bom — 0178107 11911177 
2) = aT = 410-8707 
33|  -0:839616 2-481838 mH = 287-2729 
МЧ 0327424 — 0-459405 0-005632 ЕТЕНЕ! 
| 5---- wees 089565 5 8 
~0-012299 2.022313 — 0:0825 


v 0:0714986 
" 01997638 
TC 
4 092711994 
0-1660949 
V. тг 
у 4372873 
O-2198841 
T кем. 
90571714 
8 
%| ~0-0307399 0:0047281 
i 291025736 0-0633452 
V| -01334065 
= 0-0680733 
ly 0'3085577 0-4329902 
Wee 04418649 0:5010635 
ly 0-00; 0:000458677 
872834 — 000539027 
Ш| 0084999559 9.09012160 0:018757678 
= 
is 007995066 —0-09551187 0-019216355 
= " 00000540821 
000344845 0:00183911 = 0:00100720 
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Little explanation is needed of the operation of this scheme. The primary 8% (from ( 3-1) 
and (3-2)) are entered in lines 1, 3, 7 and 12. To form line 2, all quantities in line 1 are divided 
by 5, whose reciprocal is Сор, a quantity which is at the same time entered on line 1 а In 
this and subsequent calculations the signs are changed to the right of the heavy vertical 
rule. Line 6 is similarly obtained from line 5, line 11 from line 10 and line 17 from line 16, 
by dividing by 8,10 55; and Sz3.012; Whose reciprocals are бү, Созо and 633.912 respectively. 
Line 4 comes from line 1 as shown using the multiplier — бу; the next line is the sum of lines 
3 and 4, as is indicated by the short horizontal rules. At the next stage lines 8 and 9 are 
derived from lines 1 and 5 by the use of b-multipliers already determined in lines 2 and 6i 
line 10 is another summation, this time of lines 7, 8 and 9. And so on. The appropriate b’s 
and c's are entered in the supplementary lines and subsidiary table as calculated, the 
subsequent calculations in these lines being as shown; vertical additions are made in pairs 

of lines, for example line 20 is the sum of lines 18 and 19, line 3’ is the sum of lines |’ and 
2’, and so on. 

We are only concerned in this paper with the fitting of orthogonal polynomials, and the 
computation scheme has therefore been drawn up with this end in view. But it should be 
pointed out that the scheme generally is one for matrix inversion, at any rate where the 
simultaneous equations whose solution is desired are symmetrical, and it may be found to 
compare favourably with other schemes which have been advocated in the past. It requires 
no adaptation to be used in standard multiple regression work, as described in $2. 


5. Numerical example. We shall fit as far as the third degree polynomial the followin’ 
data: 


Weight (w) 1 2 


1 1 2 3 1 2 1 
Values of ж 0 1 3 4 7 8 10 11 12 
Values of y 0 2 5 6 9 8 7 5 4 


Since for a polynomial of the pth degree we require the weighted sums of the first 2P 
powers of x and the weighted sums of products of y with the first p powers of 2, as well as 
the weighted sum of squares of y (for the analysis of variance), the first step consists in 
calculating these quantities, either by means of a tabular build-u 


5 о 
powers. The calculation yields Pac е 

би “лм = 14 би = (шу) = 78 

Sto = Sor = Х(от) = 91 Sn = (ух) = 589 

Soo = Su = Soo = X(wa?) = 803 б,» = У(шул?) = 5049 

Ви = Вы = ба = So = (wa) = 7705 Sys = Х(шуаЗ) = 46207 

Bn = би = Sis = (шй) = 77447 

Saz = Sos - X(wa*) — 804121 

б 


= У (26) = 8555663 бу = X(wy) = 538 
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The fitted curves of degrees 0, 1, 2 and 3 are: 
л = 557143, 
У, = 3:05133 + 0:387712, 
> = — 0-33965 + 2.481842 — 0:178202°, 
Y, = — 001222 + 2.022372 — 0-082562? — 0-0051423, 

The analysis of variance need not be separately written out. The residual sum of squares 
from Table 3, is 538 — 536-0810 = 1-9190, with 5 degrees of freedom. 83 is therefore 0-3838, 
whence it follows that the linear and quadratic terms, with mean squares 31-7920 and 
69-2300, are significant at the 0-1 % point. The cubic term, with mean square 0-4875, is not 
Significant. If this were a practical problem we should either go on to one more stage, or, 
if that was not thought worth while, rest content with describing the data by means of the 
above quadratic У. This may be plotted against the actual data by substituting for 2, when 


we get the following results: 


Weight (w) 1 2 1 1 2 3 1 


ә 
Values of у 0 2 5 6 9 8 7 5 
Values of У, — 034 1:06 5:50 6-74 8:30 5-11 6-66 5:40 3-78 


sł will be (0-4875 + 1-9190)/6 = 2-4065/6 = 0-4011. This is multiplied by сл», сүү» and 
Съз оу respectively, and the square roots taken, to deduce the standard errors of the coefficients 
925 бег ANA бр» оу respectively, of Y,. These standard errors are 0-4188, 0-1652 and 0-0136 
respectively. From these it appears that буо» and брзо are large in comparison with their 
Standard errors, and very significant, but that 5,45 is not significantly different from zero. 
The calculation of the estimated standard error of an estimated Y, may be illustrated by 
taking the value 6:74, which corresponds to 2 = 4. 85 = 0-4011 has to be multiplied, as in 
3-1: ° ° 
(8-18), by Coo 8 + Cao 1 + C2201 e 
in which the с’з are obtained from Table 3 and the Гв are given in (3:8) in terms of the b’s, 
also obtained from Table 3, and x. Putting 2 = 4, multiplying the above expression by 
04011 and extracting the square root, we find the estimated standard error at this point 
to be 0-26. 
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POPULATION DIFFERENCES BETWEEN SPECIES GROWING 
ACCORDING TO SIMPLE BIRTH AND DEATH PROCESSES 


By J. H. DARWIN 


1. INTRODUCTION 


In some ecological problems, treated especially by C. B. Williams (e.g. 1944), the numeries 
sizes of different species of plants or animals present in a sampling region have followed 
Fisher’s logarithmic law, viz. the expected number of species having an observed population 
size nis ил" (n = 1,2,...). In thisaisa constant depending only on the biological 28500187 
tion being sampled and not on the interval of sampling. This law for the observed species 


was derived from the negative binomial distribution, for which the probability that а 
species has n members is given by 


_ Tn k) с" 
n^ Г(т+ 1) Г(Е) (1 Toc 


by allowing the parameter Ё to tend to 0, and the number N of possible species to tend to 
infinity in such a way that Nk = а, (1+0) = x. The negative binomial is a compoun 
Poisson distribution, i.e. it arises from the assumption that the population of each Vw 
is represented by a Poisson distribution with mean m, the different m’s for the differe? 
species being regarded as a random sample from the distribution 


(n = 0,1, ...); 


(1/с®*Г(Ё)) 28-1 е2 dæ, 
Thus if there is a large number of species, most] 


sample, Fisher’s logarithmic law might be a r 
between them. 


е 
y rare, in danger of losing a member to En 
easonable description of the heterogene! 


randomly scattered in the interval (0, t) 


| 
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model for the changing number of species is applied to species which expand geometrically, 
and logarithmic type distributions of the observed population sizes of the species again 
occur. 

2. FINITE DISTRIBUTIONS 
Tf the size of the population can take any of the values 0, 1, ..., №, the equations of the birth 
and death process are 


ІР, < 
Е - ба Бағ = (а, tB) B, + Bua Bi (п = 1, 2, recs N- 1), 
(1) 
ар, ару 
= = q, P — hoPv TUE — ах Py Êy-1 Pas 
in which P, is the probability that a species has size n (n could be the number of regions out 
of a possible N in which the species could be found); 2, is the probability death-rate and 
Ên the probability birth-rate when the size is n. 

2-1. Transition probability rates not varying with time. When а, and f, are constants, 
Which is, for instance, reasonable when the spores or seeds of a plant or the offspring of an 
animal can travel over distances of the same order as the dimensions of a region, the set of 


equations can be written as dP 
ІР у 
— == , (2) 


Where P is the column vector of probabilities Po, Р,..., Py, and A is the matrix with non-zero 


elements only in the three main diagonals 


Bo = 
— po о; + Вл og E 
or Ay ы 
ayatna - Фу 
— PN- Фу 


It is well known that if the latent roots A; (i = 0,1, > N) of this matrix are distinct, the 


N * ; 5 
Solution P, is of ће form X Ам“, where the А„, are constants independent of time and 
i=0 


embracing the initial conditions: It is further possible to show that if ол, ON ON UT 
are > 0, these roots A; are real, distinct and positive (a result also established by B. J. Pren- 
diville, unpublished). This follows from an expansion of the leading (n + 1) x (2 + 1) minor of 
— A | as D,,(A) in the form 
РА) „= (A ж (са TÉ) D,4() ~An Bua D, (A). (3) 
An induction hypothesis that D, (А) and Р, (4) have real distinct roots Hoo Pas «e fna 
Vo, 9, пора; Such that Ko < Vo < #1 < -= « Vy a < [6 leads to the conclusion that D,(A) 
ав real distinct roots separating those of D, (А). An expansion of | АТ— А | shows, by the 
Sign Tule, that there can be no real negative roots, and that the lowest root of. D, (А) for 
<N is 50. The result then follows. It can be used as a criterion for testing if possible 
Processes of this simple type could lead to given distributions. 


hus, i 
Ж” АҺ. Pags ала Pra = (0 Р 
t 
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(1) becomes, when P(t) is not identically zero, 


(4) 
ГАО) == уља ћу — (4, tf) “В-ға, 
This is а set of N + 1 equations in the N + 1 coefficients of e-^i, " 
ШТ” N-n, since Py = рУ must be о 
i i = р" (1— р)", since Py = p 
2-2. Examples. In the binomial, P, ( а) р" (1-р) pe: 

form У 4,674, p can only have the form a +be, with 0<a+b<land 0<a<l. 

= ni 2 

i=0 


- Е (a) [np*- (1. р)" — (N —n) р — p) N71] ( — bp e) 
1 т 


N-— 
= -ube (2 е) р, 
_N-n p 
and Та = "ell 


Whether or not a solution is possible can be seen by substitution in 
sets of equations 


(4). This produces the 


N-n ЕЕК ИЕ cat ей (5) 
з Па за ур а) , 
N-n, 2. Е _ (6) 
баг 2a— (+ Ba) (1 — 2a) - Ва 2 n j^ —а) = u(Na —n), 
N-n n (1) 
Anti n+l + (5 B) B, = AN. 


N — (n—1) 
These provide 3(N +1) equations for 2( 


ге not in 
N +1) unknowns Ans В,, but the sets are по 
dependent. The unique solution satisfyi 


ng them is the well-known one 
фу, = от, В, = (№ т); а = В8|(®+8), w= (a4 f), 
and 6 is determined by the initial conditions, 
Polya (1930) considered the compound binomial distribution 


= үү n p yN-n g=] -pya 
Р, = | МЕ (1-2) `` В.) — 
z (7) ala 4-1)... (афт (2+1)... (BAW 1) (8) 
n + e+ B+)... EENI 3 
In his work it arose from а contagion urn scheme in which, after each of N drawings fro™ с 
urn containing « white balls апа / black, two balls of the colour drawn are replaced. P, 
then the probability of obtaining п white balls. Skellam (1948) applies the distribution | 
the secondary association of chromosomes in Brassica, and to accident and traffic proble” > 
It can also be used as a description of the Spread of species Over N regions when the рор"! 
tion size of a species increases by one for each new 


region j ih e hetet? 
geneity in the binomial then describes that between паа ened .... specie 

The form of P,, as far as the constants æ and 3 are concerned is as in а hypergeom? : 
distribution, so that the present result will also apply to that. › 

The only rational form оба and В that will not, make Р an infir ite series in powers 0189 b 
e isa = a5 and в =p Ce, where 0<а,5<1 pue c ees hee and — 24 е 
1—aifc<0. 8 і Consistent, for бб be^ the spread of a 8р d 
Зе process, 


pe 
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The equations similar to (5), (6) and (7) are now ЈУ + 1 in number, the simplest to find 
coming from the coefficients of 2799, e7Y* and е“. They are: 


tnla +n) (а n — 1) (N 2) (Y — (n — 0) 
— (a, 4- B,) (0+1) (N — (n— 1)) (6 + У — (%+1))(а@+— 1) 
Ban r1) У —"п) (6 + У —(п—1)) = 0, (9) 


&, UN —n) (N — (n — 1)) + (2, +6,) (8 1) QN — (%—1)) КВ 100+ 1) 
--(Қа-1)(Х-(а-1)), (10) 

ап) (N — (n — 1)) (а +п) (а +0— 1) [ya +n- 1) – (a) - Wb + N — 0-1) + V(0)] 

— (04, + B,) Qi 1) (N= (1 — 1) (ав) 0 N — (n 1) 

x [V (a 4- n) 2 (a) — (+ У —)-* V(b)] +11001) (+ —п) (6 N (0—1) 
x [V(a- n —1)-v(a)- V(b--N —(n— 1) + v(5)] 

= =A(n+1)(N - (n 1) 0 – (n D) (a D) 

g [/(а++в)—у(а)— V (b4- т) + 


(11) 
Where pe ор 
y(x) = T og T(x). 


(9) and (10) are satisfied uniquely by 
Би e 
on = N(a-b-N-1) '" Х(азь-Х-1) 


which do not satisfy (11). Thus this compound binomial cannot arise from a process of this 


Simple birth and death type. 
3. THE MORE GENERAL EQUATIONS 


When n is not necessarily bounded by V and о, and Ên are general functions of time, the 


e . 
quations of the process are 


23 а Виа (alt) + Bal) Pat Baa Pray 
М (12) 
LS L а) Boll) Po 


The first n+1 of these add to give 
а ou Prr—BuPa (= Ол), 


OOF ор 
220 " n+l 
n 
Thus if it is possible to find a simple closed expression for x P., we can use this equation to 
= 


fing values for а, and Ву Otherwise these values can be found with greater difficulty 
i n 
rectly from (12). 
`1. The negative binomial di: 


bem) 9^. wool). 
PO Beat рана "900 


stribution has the form 
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if ос’ = dc dt, 
dP, Г(Е+т) orto’ T(k+m+1) о” = луг тај 
dt Г) ®-1! +0)" nl ое 


say, and n 


dP. (пк) а" kin с ) 
> =—Р шыт; В, * 


The required equation in n is thus, for n = 0, 1, ..., 


=g” _ Zna 0 Bn (13) 


Е ' " п. 
Suppose Ё is constant and the distribution is generated from P,(0) = 1, i.e. с(0) = 0. The 
150 ВИО n+k 


The solution for с when а, and f), are given positive continuous functions is, as ше, | 
20. The simplest solution is Kendall’s (1948a), ал = a(n +1) and ff, = (п + k), where ; | 
and / are positive constants. But it is also true that negative binomial will result when 


and В vary with time. Generally 

t t t 

a =exp(-[ «-/да) | Bexo (| (2—A)at) dt. ‘ 

0 0 0 

Polynomial solutions of higher degrees for а, and f, will be more particular solutions 10 
the sense that the coefficients of n,n*,..., cannot be assigned arbitrarily in both &n 
and f, 


3-2. For the positive binomial with probability parameter p and index N, the equation 
corresponding to (13) is 
dp __ољар , By(l—p) 14) 
eet? усы (neus Ag. ( 


A solution а, = o(t) (n+ 1), В» = P(t) (N —n) is then possible for positive functions alt) 
and f(t). For instance, if n describes the number of regions out of № occupied by а species 
of plants, it is reasonable to suppose that the rate of new coloniza 
each seeding time, when the rate of loss of colonie 


and in anti-phase, a situation typified for examp 


tion is at its highest d 
sis at its lowest, i.e, a, and f, are регіос! 
le by 
а,(4) = (3o) (1 sin ot) m, 
Palt) = (40) (1 эйе | 
For these values, p = }{1 + (1//2) sin (wt— 
length of о, and f,, corresponds to p(0) 
The Poisson equation similar to (13) and (14) is, for a mean m 


(15) 


17), having a time lag on f, (t) of half the ph i 


dm — uuum 


di n+l Fi 


which again is satisfied by arbitrary positive functions a(t) and A(t) in а, = a(t)” and 
В, = 66). а 

3:3. The geometric distribution is the case % 
ability rate 2, = A(t) (n+ 1) is not that usually met ; У Y 
Ên = f (t) n. The difference arises from the ais he бае ойы, аа 


ferent i 
P,(0) = 1, and considered a formula for P, which startin 


b- 
= 1 of (13), but the required birth РГО. 
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a birth-rate proportional to the population size will not move a process away from F,(0) = 1. 
If, however, Р (0) = 1, а solution for the birth- and death-rates follows from the supposition 
that P does not necessarily agree with the formula for Р, with n = 0. 

Then 


1—P,(t) озі E 


P(t) mie К (190) (n = алыса 


The equations obtained from addition of the process equations are 


2% (1+0) -пс' = 01417 (1+0) (в = 1,2,...), (16) 
= Бо 
and the particular equation арр 1-Р ор 

Uu ITA gun, пт) 


The simplest solutions are those given by Kendall (19480), viz. а, = a(t) n and д, = f(t)n. 
For these, (16) and (17) reduce to 


с' = — (a-f) +ê 
Ра 
ала Ў ra. 140, (18) 
the solutions of which are valid under the initial conditions Р,(0) = 0, с(0) = 0. 
3-4. The logarithmic distribution for the initial conditions P,(0) = 1, 0(0) = 0, is given by 
n 
ped E [sim б). 


P0 = ogl + ojn +o)" 
We require 


то" а 1-Р эи Ona WO — (a, +6, табеті eo) 19 

ов ЕЕ (nt1)(i+o) O (п-1)6 (19) 

together with two special equations for P, and Py. Suppose polynomial solutions are sought 

fora, and р. Then (19) when multiplied by n— 1 becomes a polynomial identity in n. Thus 

n n dee es 
%n41 is divisible by n + 1, and f, by — 1. Whence, for = 0, 
1 —Py(t) = constant х log (1+0). 
Таз А р _ d о(0) = 0; thus polynomial solutions are not possible. 
th P,(0) = 1, and ; VP К Е 
But et pm „ја 0, such solutions exist. An artificial one is the disturbed 
0) = where ж and £ are constants with > f; and 


Beometric - By = fn (nz 1), 
rocess, 0, = # Pn 5 . "m М 
ois a ane immigration term to maintain the population if it should drop to zero, viz. 


1 
в - Же [| with k< =. 
| ЕТЕР, pane) ДЕР 
ko” 
Then Pl) = па кој" Pe eh 


13 1—e-«-t 
қ-на [1 Sgen wee e еті 


When k+0, /%->0, and Р) > 1, but 
B, NE sc for nzl. 
i-B, о (1+0) "(1 t?) 
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4. A STOCHASTIC DEVELOPMENT OF NEW SPECIES 


In the previous paragraphs the discussion has been mainly of the growth ofa single «тил 
In this paragraph a simple model for the evolution of a number of different species is co 
sidered as a possible source of heterogeneity, Yule ( 
into genera when new species arise by mutation. 
species in a genus is governed by a geometric birth 
genus contains % species after a time of establishm mE 
stant A. He remarks that one might also suppose that the probability rate of speciation 18 
dependent on the population sizes of the various species, but gives no guide on the form this 
dependence might take. One might, as a first approximation, suppose that the larger the 
population of a species, the greater its chance of producing a member which might be 
classifiable as a member of a new species. We shall assume that each species develops from 


its original member with birth and death probability rates proportional to the population 
size % of the species. The extra assumption is that there is 


1924) discussed the groupings of species 
He assumed that the total number of 
process, so that the probability that à 
ent # is eM (1 —е—Муп-1 for some con- 


d at some past time when the total population e 
lity of speciation rate to size, and the fixing of # 


а rough approximation 


tors were taken into ace 
other factors involved, such аз the struggle for 


quality, the constant Ё must be regarded as summing up the mutation rates and the environ 
mental variability. 


one member. Then, if 


^, members after time t, the probability of a furth 


ЕУ 5; (L—7,)g,(t— 7.) =k У п(1—т,) say 
ico с ` 


Suppen e "%;-т)- ero, эт) with То = 0, 
P(0,7) = exp ( ШІ 
0 


and 
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Then it can be shown that P,,,,(t), the probability that there are N + 1 species at time 2, is 
Я N 

т р(0,ї)р(0,4— ту)... р(0,#—ту) II а (то, -.., T;) dr, ... dr, 

~ N j= 

= | a Ty, t)dT,...dTy, say, (20) 


ты Ry is the region given by the inequalities 0 < т, < Ta, 0<7,<7;....,0<ту<і. Ру.1(0) 
пеп follows a birth process with transition rate Ву (6) given by | 

Pysalt) Py) = үр P( tq, о Tys t) була (То +++ Ту» 0) ту... Ту. (21) 

. E 

E is hardly possible to use these results except for the simplest form of geometric distribu- 
ав It would be most satisfactory to deduce а multivariate distribution of the different 
Numbers of species which at time ¢ have 0,1,... members. Failing this we can find the 
ving a given population. Since № follows a birth process, we 


aver; у 
erage number of species ha 
of species at time f, 


m " е ы 

ау write the equation for the mean number, N, 
dN 

— = Убу Py. 


5 
T | : 3 . 
hen the equation for the expected number М, of species with n members is 


E 
N= | Spy Pygult—7) 4+ galt) 
0 


ғау 
- [= g,(t—7)dr  g.(). (23) 
When this is multiplied by n and summed, our initial assumption gives 
dN ‘ам. = 
—=k C m(t—T)dr 4- mnt) |. 23 
dt p o dT ( (23) 
This has the unique solution for the Laplace transforms ¢ of 7, and у of dN ја 
ko 
=e 54 
y — 1-59 (24) 


à 4-9. The simplest example occurs when both birth- and death-rates A and д are equal 
° А. Then At 
т”-1 COT m i == 
Ialt) = rape (n = 1,2...) got) = 14 AC with g= 

This is perhaps the most realistic of the geometric laws, since it corresponds to a group of 
рев 80 near. оп the average, to equilibrium with their environment, that a new species 
^." difficulty in establishing itself. R. A. Fisher (1930) holds that it is reasonable to suppose 
Jat most, mutant genes that are likely to assist in the рони југа gta specieswill probably 

'SAppear from the population in time. For instance, a gene giving а 1 % advantage 

Wil die out with probability 0-98, while one giving no advantage will die out with 
Probability 1. It ^ основ; hat most mutants that can survive make 


lit ly considered th 
Че effective change in ап organism's 


b rela 
в 5 ә > 
ti Cnt artificial situation in whicl 


1 a new species is 
® it must be assumed thatitisnot much better adapted than the others. If, as postulated, 
Speo; s 
Pecies grows geometrically, it 


s chance of possessing no members after time #— /A if 

> P n Я 

9% бе; and to 1 otherwise. Amongst such laws of grow h then, the one most representative 
àt happens is that for which A 


tionship to its surroundings. Thus in the 
supposed to be created at a definite 


= pH. Then z(!) = 1, and Ру(?) = e-kt(] — e-™#)N—-1 as in 
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Yule’s case, № = ез, When the distribution (22) is applied to actual data, the values of kt 
and At required are such that the distribution is very near its limiting form 


С М, со се“ ту у? % 
= 24 „ы а == 1202522; 
Pn [юу o (14g) y e | (25) 
(22) ay 
Фо = 1-| ТЕ dy, where а = [/А. | 
0 


The p, depend only on the ratio of k to А, since these both have the dimension 1/time. The 
following properties hold: 
(1) Pn > Pny for alla>0, andn>1. 

(ii) py may range from 0 as о со, to 1 авш->0. As «>o, р > 1. 

(iii) The rth factorial moment of (25) is r!(r— 1)!/æ"-1. Hence for small k, despite the 
fact that py(x)— 1, the moments become unbounded. This is implied in the increasing 
average age, 1/c, of a species, since, as a species grows older, g, still increases if n — 1 > 2Ax 
its age, i.e. the larger contributions to the moments increase. When a is large, the ordinary 
moments all— 1. The overwhelming majority of species are only recently born, and have 
in virtue of the initial condition for each species only about one member each. ) 

4:3. Variation ina. One of the chief defects of the model is the assumption that all species 
have the same Ё and А. In (24) k can be looked on as a mean value, if it is assumed to be 
independent of time, since it occurs linearly. A does not occur till (25). In (25) then, it 18 
possible to allow æ = k/A to have its own distribution. Because of (i) the resulting distribu- 
tion of n will have the same general shape as (25) 

For (25) there is the recurrence relationship 


(0+1) Pia = ор, talpi +... +p) фазе (а) (в = 1,2, ...), (26) 
where -ЕЦ-а)- І —de (British Association Tables, vol. 1) 
‚ ава Pı = a(1+ae* ЕЦ —ш)) = a(1—p,). 


The terms extra to np on the right-hand side are of order a+ e log for small a. The mean 
value of æ will be small, since an ordinary birth is much m і 


re ore common than a specie" 
generating birth. Hence one expects that Вера will vary only a little from a logarith™ 
form, constant ха” [т for small 1 — 


r sn v. But a small æ implies by (iii) а distribution wit 
very large moments. A distribution that leaves all moments of E, р, finite does not lea 
easily calculable probabilities. The most obvious dP (a) is dF (a) ie (e"|T(m)) een a" © 


Then [m + 1] moments are finite. When c is large, as is requi | 
Ka , ired ta, the 
probabilities may most easily be calculated from 2 дады 


(m+1) 
m ee овоа) V2) ра). 
4 41)... @+7)(т+2)... (me r4 1) 
rir lie (log c— (n +1) 
— (mr 2) 3 (r1) - (ra 2) +... у “i 
_ m(m+2) с ута 9 
1-%- аан 1) 10886. 1 Съни 


ti (m+1)...( 
“Пи ) ис а-ы! MS те)... |, 


Pn = 


new 
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(27) can be fitted from the first two moments. Thus if и is the observed mean, and v is the 
ratio of the observed second factorial moment and the mean, the equations are, since the 


Zero class is not observable, әс 
(28) 


т=1 
апа 1 = 1-р = —%e* Bi(—«). (29) 


(28) gives c in terms of m; there will only be a solution for m in (29) if 


2 > 2 e ( — Ei( — 2[v)). 

u v 

However, in the case tried, namely C. B. Williams’s data on different species of Macro- 
lepidoptera caught in light traps over 4 years (Fisher, Corbet & Williams, 1943), a solution 
is not possible. The nearest approach is when ¢ and mco in the limiting ratio 2/v, i.e. 
When the distribution is back to the form (25). Then a better fit is obtained from the same 
distribution with the value of а given by the equation l/u = —ae* ЕЦ — а). 

Generally, for any distribution of а, it is to be expected that the maximum-likelihood 
fitting of E, p, will be very difficult. 1f, then, one were to fit by moments, including the 
Second, the equation Ё„(2/®) = v would be used. Since Z(2/2) > 2/H(«), the smallest mean 
9f (о) would be 2/v, which is reached only when Ф (а) tends to the form Р(х Фо) = 0, 
Ба) = 1. For the sort of distribution dF (a) we would be likely to consider, this would 
mean that the variance of а would — 0 as E(a) > 1[Е(1[®) = 2/». The data, of typical ‘log’ 
Shape, are fairly well fitted by (25) with a = 0-00290 and 2/v = 0-00326. Thus it seems likely 
that a ар (æ) leading to a better fit will have to have a mean near 0-00290 and so tending to 
0-00326 from above. Thus it will have to have a low variance and there will be little point 
Mm using something like (27) rather than (25). An improvement might be found in allowing 
A to be different from in the geometric law gan(t). 

4-4. The average distribution with unequal A and 
Of the type (22) has infinite moments of all but а 
than (25) as a description of data of this kind. 

When к> A the distribution is 

"n d 


MTS (i-i 
E je Jt 


u. When А> the limiting distribution 
few orders, and is thus less satisfactory 


dx, 
n 0 
the average distribution only makes sense if a large 


in whi 0, as 
ich &—(~—A) must be >©, the average number of species born up to time t is 


Number of species has been born (У 
Proportional to exp (0— (/t— ^) ?- 
ғ = k[(u А), p, шау be written as 
(£— 1) at. Г(п) pint 1,6; at&AB) (n= 1,2,...). 
(а(а- 9) V Г). 


Ар nearly 1, = }/(ї + 1) say, the easiest form for c 


ESL af-1(1— 0102) de 

+10 

_ @—1)0@ ЕК tog q+) 
D 


alculation for p, is, when 7 is large, 


1— = 


© = T = 00 = 1 
(Е jee yee ee ) (зо) 


r=0 


380 | Population differences between species 


Then later p,,’s can be calculated from the recurrence relationship 


в > ae 5-1. 25-1 ЈЕ = TB, ess] 
(n 1) p, =n Patry n Ра) pel |27 +129): (= 1,2,...) 


(31) 


The chief difference between (31) and (27) is that in (31) the probabilities are exhibited as 
corrections to a logarithmic term, whereas in (27) they are given as corrections to a harmonic 
term. Both correction terms are small. The moments of the distribution determined by (31) 
are all finite. One may find £ and 7 from the equations 


(32) 
1 28-1 1 
o Тар aa) T 


The values of the right-hand side range from log (1+ v)/v when £ = 1, down to 
(2/v) exp (2/v) ( — Ец — 2]v)) 


as +00 (when 7 is substituted in terms of £). For these data Е = 7.3, 1 = 2549-0. The first 
25 frequencies and groupings up to an beyond 50 are compared with those for the log 
distribution. They agree closely after the first. The figures for the æ distribution are so close 
to those for the 2, 7 distribution as not to be worth recording separately. When о is small; 
as here (а = 0-00290), it is approximately 1 —z when the log distribution is given by 
N, = const. «"/n. The value of 7 shows that if this model is at all relevant, the value of An 
is almost indistinguishable from 1. The interesting suggestion from the ratio E/n is that ong 
species-generating birth occurs about every 350 generations. 


Macrolepidoptera figures for all four years 


— „ж 
М, 
" N, 2 i 
Ж N, £3 n log А N, he log T 
obs. DW. ЕЦ istribution 1 istributio 

distribut: ^ — distribu = 
stribution |. 0-99742 obs distribution iam 0-98 

| к кыз » у m 2 aed 
1 35 44-09 40-14 17 cl 
2 11 21-80 20-03 18 В 255 214 
3 15 14-38 13-32 19 3 ie 202 
4 14 10-54 9-96 20 4 n 1:92 
5 10 8-36 7-95 21 1 1:77 в. 
|u| р el ук 
8 6 5-09 i 23 3 1-60 ed 
9 4 4-49 491 Pe 1 Mis re 
; 25 й 10 
n | è| ge | ÁÀBo oiii s 
12 2 3-29 3.25 и : red ES 
13 5 3-02 3-00 aes 2 er 1% 
а | 2 2-78 TA 41-45 5 3-84 419 
14 5 ~ (46-50 3 3.38 зі 

16 3 2-40 35 |e во oe | Сон 

Total Jr 240 М3 
| 


| 
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4-5. Use of the model fo ite distributi si 

ға finite distribution. $ 5 1 
E бин lfi | finite distribution. Suppose as in (1) that species of plants are 
р: g over M regions. Then the spread may possibly be described by some simple 


birt : 
В and death process (z,.//,), with 2, and В, not depending on time; we might, for 
? Bv. 


insta РТЫ Egi 
ance, consider the logistic process %, = ита, By = An(M — n), where n is the number of 


regi ; У 
gions occupied by a species. If one supposes that the mean number ЈУ of species increases 
a 


ex ; қ 
xponentially as in 4-2 and 4-4, the average rate of production of new species is 
Es Py(t) = aet, 
age distribution of the N = et species >a e du аз t-> 


age x occupies n regions is 0, (2) when g,(0) = 1. 
then be obtained directly from the 


PE а is a constant. Then the 
p. the probability that a species of 
ч ne limiting distribution p, that we have been using can 
quations of the process for the (, (2). Thus 
с Р © 
а! e-a Ри) dt = а? e- P (t) dt — a, P,(0) 
0 dt 0 
© © 
алај еа (ata) а |" ena 
0 


Јо 
Т) e P, di (N2n2»1) (33) 
© 
Ог, аз af et P dt = ps 
ар, = Сабин 7 (о + Вы) Pn + Bn—1Pn—» (34) 
Wi |. 
ith the special end equations, 
—а+ару = %2P2—(% + By) Pi + Вор 


apo = одр — PoPo 
a 
ч apy = —%Pxt Вх-а х-і- 
such as (26) or (31), 


nm = 1,2,...). (35) 


Th "T 
© equations add to give à recurrence relationship 


+а(Ро + --: +Pr) -4 ( 
7% When Dy can be found, as is the case in 4-2, (35) can be used to give the p,'s. For n < M, p, 
8 of the form A y+ By» the coefficients Ал and В„ being obtained from (35). Then p,, can 


° found by addition of the probabilities. 125 | 
9 show the possibility of such ‘explanation’ of the distribution of species, we can fit 


Чада, (kindly supplied by Prof. A. В. Gemmell) on the spread of moss species over the six 
Slang of Hawaii. The process Фа = 0: E An(M —n) has been used. This is not quite 
“realistic ан, species is only observed when it has a large number of plants and so is not 


80 li " 
likely to disappear. However, the fits are good: 


ба Ва = Bn Pn 


even with a, = 0, 


| = _2)(M—(n—2)) 1M -1) y 
p, = @—1)0И-(@-—1)) (% 2) „А-В - 
y n(M —) y4(-1) M-n- 1) у+2(М 2)y 4- 10M — 1) - 
X у = ША. 
25 


1 = ——. 
у-ЦИ-1) 


lometrika Xo 
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; i Бе! ious 
The figures are, for the three different reproductive types of species, dioecious, monoeci ; 
and sterile: 


Moss species in Hawaii 


Dioecious Monoecious Sterile 
n Obs. Exp. n Obs. Exp. | n Obs. | Exp. 
| —— 
ы þa 
1 12 11-19 1 15 13-88 1 22 | 2128 
2 6 5:70 2 6 5-64 2 7 Tl 
3 4 4-22 3 3 3:39 3 5 3-56 
4 3 3:87 4 2 2-48 4 1 9-11 
5 4 4-53 5 0 2-13 5 0 1:39 
6 13 19-49 6 4 9-47 6 2 0-97 
у= 1-816 y = 4:308 у = 7:156 


y was calculated in each case from the maximum-likelihood equation 


1 1 "m 
Pu i tr) E PROP 


4-6. Remarks. The simple assumption of proportionality of speciation rate to the ыш 
population thus leads to distributions substantially like those observed. Yule’s data for ta 
increasing number of species in a genus bears out the e =" 
for N. For such an N the form of the Pn stil 
growth laws for the different species may 
a logarithmic form. For instance, rand 


be the consideration of competitive population: 


law, but they are possibly best considered in relation to the shifting of the populat! 
inside the living region. 
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‘MODIFICATIONS TO THE VARIATE-DIFFERENCE METHOD 


By M. H. QUENOUILLE 
Institute of Statistics, University of Oxford 


1. INTRODUCTION 


1-1. Methods of trend elimination in time-series analysis fall into two main groups. First, 
where the form of trend is known or may be assumed to be polynomial, it is possible to fit 
polynomials and to use residuals from this fitting in further analysis. This method has been 
described elsewhere (Quenouille, 1952). 

A second, more exploratory, method which assumes nothing about the form of the trend 
makes use of moving average difference formulae for its estimation. This method is, however, 
Subject to the criticisms that the form and extent of moving averages chosen would seem 
to be somewhat arbitrary, and that the elimination process may easily give rise to misleading 
Tesults (Spencer-Smith, 1947; Quenouille, 1949). 

1-2, The variate-difference method (Anderson, 1914) is closely related to the method of 
moving averages, and may be regarded as having greater generality. Any moving average 
difference formula may be shown to be equivalent to an average of higher order differences 
and, consequently, results obtainable from such formulae may be obtained by appropriate 
Manipulation of the variate-differences. For example, if V?z; is used to denote the pth 


vari i i 
variate difference, i.e. 
р\ DA s 
Vos, ин“ 1 Жар + 2 Яр. 


then a simple five-term moving average difference formula may be written 
Bd, = — а Яна + Фа, — S21 — Vi-2 
= Vig, — 8V*2, 4 7 Vie 
Results obtainable using the residuals from such a formula are therefore derivable from the 
Variate-differences. 
153. The classical variate-difference method has therefore a greater flexibility than the 
latter may be judged from the variate- 


Use of moving averages, since the effect of the lati ed · 
difference. Further, it has the advantage that criteria and tests of significance can be set 


v : 
Тр to judge the extent of the trend. | | ж | 
A major disadvantage still exists in that the existence of serial correlation in the residual 
clement of the series will bias the results and produce misleading conclusions about the form 
Of the series, Also, it is not clear that the tests used when serial correlation is not present in 
е residual element are the best possible. 


l-4. Th i t to do two things. The 
. The following paper sets ou 
“Tving Sesia 21 tests of the existence of trend in variate-differences of any order, 


а; + s 1 АЈ тәті f 5 4 
a with finding more accurate estimates of variances m соу c ito mites 
ement from any set of variate-differences assumed to be trend tree. e latter part, 


55 7—10, is concerned with how this theory should be modified if serial correlation is assumed 
9 exist in the mei ]s. As such i ге a strict criterion for the separation of trend 
Sidua4s. + 


+ does not giv : : 
Nd serial correlation, but ib does provide а basis on which the separation of the two may 
ake place, | 
25-2 


first part, $$ 2—6, is concerned with 
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For example, the analysis of $7-3 indicates the existence of both trend and un 5! 
tion in Kendall's sheep series. It shows further that trend is unlikely to be wo ^ pal: 
third and higher variate-differences. The analysis of §9-5 shows further that, if it не eee 
that there is no trend in the third and higher variate-differences, аб вва : pos variance 
autoregressive process is needed to represent the series and that, if such is пне ~ А иез 
of the trend in the second variate-differences is unlikely to be more than 0-4. The gor : жей 
is therefore that a second-order autoregressive scheme with a trend, small in its se ille 
differences, is a good representation of the series. As emphasized elsewhere e apap hile 

1949), this is not, however, the only possible representation, and it would still be poss si 
to maintain that the series consisted of a complicated trend with a random pem "d 
possibly, of a high-order autoregressive scheme with no trend. The process to be outlin 
gives rise to the simplest possible representation for any time series. 


2. GENERAL INVESTIGATION 
А 5 8 ; Р aking 
2-1. The classical variate-difference method is based upon the idea that by t ae 
alien ; : , serve 
successive differences of a series of observations, the ‘smooth’ part or trend of the obs ee 
tions will be eliminated, and what remains can be used to find out about the rat 


3 8 м у 5 | 5 tren 
element. In particular, if the series 23,25, ..., у p is free from serial correlation and no 
component exists in the pth variate-difference, Ура, then 


estimates the variance, 02, of the random element. 
Any average of a series of such terms also esti mates т? 


А has 
‚апі, commonly, estimates 810 
N 
(Уға)? 
іші 
uH 
А p 
are used for this purpose. opté 
20110 
For reasons to be seen later, we shall use a modified definition which applies end-cor? echt 
to the averaging process. 


is wi ing 
This will not affect the large sample theory, but it will have the joint effects of mak! 


it more applicable to small samples and of simplifying conversion formulae. 
2-2. We sh 


all define a series of summation functions Хо by the formulae: 


Lory; = 2171 Һау +... TU y. 
х0; = 1% 

@ = 22191 + 29, +... уу», 
Жахи: = феи + 393, + 


3 r 
Р ~ ЫБ т, уу, 
the summation in each cage coverin; 


‘on 
g all values up ф . ormat! 
determined by the recurrence relation P to 2,, Yn. The general law of f 


18 
Y n—1i 

2,9. = 
ae iYi РУСЕ Қау 1), 


so that, for example, the first three coefficients in У ar 
(3) are 


$, запа 5. 
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2-3. For : а А 
2-3. Когапу B ^w 9 7 1 
any series, t4, x, ..., Ëm the estimated variances, „јр „й: ma ..., Will be defined by 


У, 2 
х= 20%) 
m^ N 
У түг 
Ў = Yo aX Vt) 
оса 2N "y 
21 


Done Vex)? 


iil у x(?) 
p 


ат. However, m may be arbitrarily chosen so that it 
al to or greater than the largest value of p required. 
1as equal to the largest value of p occurring. This 
al, be omitted unless the line of argument 


Under this definition, p cannot excee 
сап always be arranged that it is equ 
| In the following sections, m will be taker 
18 usually ten. Further, the prefix m will, in gener: 
Tequires its inclusion. 

2-4. Serial correlation coefficients may be simil 


arly defined, so that 
= “т 
p = nc pin 
wg 2 


Again, m must be chosen to equal or exceed p, and, with this condition, the suffix m may 


Usually be omitted. 


2.5. Tt is then not very difficult to derive the following identities (see Kendall, 1946, p. 23): 


2p 2p(p-1l) , 2p(p-1)(p-3) , | 
рт а- сй 
„-%[1-рут"*{р+1др+®)'* @+00+8 0+9" 


qaos —] p2(p2—1 2—4 
у, = %-2% + or [Жз ш = шя 


and 3 
труд (3!)? 
approaches employing variate- 


sic equivalence of the 
mple way in which to find the 


ae formulae demonstrate the ba 
ifferences and serial correlations. They also provide a 81 
Variances of combinations of the У; under different conditions. For instance, 
204 2p? 2p*(p—1)° | 
=== 1 5+ 2 mte 
var}, = у [ рта (0+1)%(0р+ 2) 


| for a random series. 
| 2-6. When a set of variate-differences have been calculated as described above, it is 
stimate of the residual or error variance in any series that 
tes. If it is supposed that Venza +++ Гоно ore free from any 
to find coefficients ¢; (i = т+1...т-+р) во that 


r 

алыр to ask what is the best е 

| t ay be obtained using these estima 
"end component, then the problem 1$ 


| к ы” eV, 1 llest possible variance 
а = l and so that X e has the smallest pos а 3 
i=m+ 3 
nimum. If we note that 


ved by finding а restricted mi 


4. 
a 0б 4+) / 
iJ MV 


ts, and will normally be assumed to be 


This problem may be sol 


У 


Wher > 
[ here ky is the fourth cumulant of the residual elemen 
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map =. " 
cat i һе variance of У cV, the coefficients c; are giv 
zero, and (к. +Ао“)[М№ is supposed to be the variance T" i 
by the symmetrical equations 


mp 

E: M LN = | 

bp » (јет+1,..т+р). 
i=m+1 

Solution of these equations gives rise to the values for с; and А. 


2-7. The following particular solutions exist for these equations 


p=] p=2 
m 
Ован А Ста Ста 

0 1 3-00 2-50 — 1:50 2:50 

1 1 3-89 3-86 — 2-86 3:00 

2 1 4-62 5:20 — 4:20 3:43 

3 1 5.25 6-54 — 5:54 3-82 

4 1 5:82 7:88 — 6-88 4:17 

5 i 6:33 9-21 -8:21 4:49 

6 1 6:81 10-55 — 9:55 4:79 

p=3 р-4 
т 
Ста Cm42 ©т+з А Стал Стља а 
— 

0 4-67 | — 7-00 3°33 | 2-33 7-50 | — 20-25 2-25 
1 | 10:36 | — 17-91 8:55 | 2-69 22:71 | = 67-28 252 
2 | 18:18 | — 33-04 15-86 | 3-00 50-37 | — 152-56 277 
3 | 28:14 | — 52-43 | 25-29 3:28 | 94-33 | — 287-71 gon 
4| 40:23 | — 76-09 | 36-86 3-55 | 159-00 | — 486-23 en 
5 | 5445 | —104-01 | 50-56 3:79 | 245-2 - 7496 sal 
6 | 70-79 | — 136-17 | 66-38 4:02 | 356-6 — 1088-9 aron 


Я ь sing 
l improvements in М2 


accuracy can be achieved bY jnin£ 


- For instance, the yay; y comb 
V, and V, the variance is reduced to е се aen at ni + 
ейпсев 

2-8. It should be noted that the e do not differ appreciably from those F. 4 
would be used if the a, теста, di» 
2 2 3 5 spo” 7 

to i = — 1 was required. For instance, in th; op : јен Ше сач i ж р? 

Jj = 2, we should have ¢n+ 


с 

till ‚ ср but for Pr хиб 
Tougher approximation, derived bY 

= 0, using the formula, 


We mm + 1 
97 b, ур 
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ТЕ this is used, the following coefficients are obtained: 


! 
Ф=1 p=2 p=3 р= 4 
m 
Cm41 À дыз биа | A Сты | Ст+2 | Стаза А Cm+1 | Ста | Cmts | ©т+ А 
TM | 
0 1 3:00 | 2 —1 | 2-56 3 -3 1 2-42 4 —6 4 -1 2-85 
1 1 |3801 3 | -2| 3:08 5 | -6 2 | 2-89 8 | -15] 11 —3 | 2-75 
2| 1 |46 | 4 | -3| 353 | 10 |-15 6 |317| 20 | —45| 36 | —10 | 2-98 
3| 1 | 595] 5 | —4| 393] 15 |-24 | 10 |348| 35 | -84| 70 | —20 | 3:24 
4| 1 | 582 |6 |—5| 429 | 21 |-35 | 15 3-76 | 56 |-140 | 120 | —35 | 3-48 
5| 1 | 633 | 7 | —6| 4562 | 28 |-48 | 21 |403| 84 |-216 | 189 | —56 | 3-71 
6| 1 [6811 з | —7 | 403 | 36 |-63| 28 | 428 | 120 | 315 | 280 | -84 | 3-92 


2-9. This table shows that an appreciable increase in the efficiency of estimation of the 
residual variance is obtained even with simple extrapolation. The efficiency of this extra- 
polation method as compared with the most efficient linear combinations, obtained in § 2-7, 


18 as follows: 


m p=l p-2 p=3 p=4 
0 100% 98% 96% 96% 
1 100% 97% 93% 929%, 
2 100% 97% 95% 93% 
3 100% 97% 94% 93% 
4 100% 97% 94% 92% 
5 100% 97% 94% 92% 
6 100% 97% 94% 92% 


both m and p. However, іп view of the computational 
be given later, of the extrapolation method, its use is 
in efficiency from its use can be compensated by 


The loss in efficiency increases with 
Convenience and other properties, to 
Benerally to be advocated. Any loss 
employing more values of И. 

‚ 210. If it may be presumed that И 1... Тир are free from trend, the trend component 
In V, may be estimated by comparing it with Ролл ++» Гана А further question is then posed: 
Which estimate of the trend component in J, is the most accurate? To answer this, we have 


m+p 
,,m 4 p), subject to У 4, = 1, such that the variance, 


i=mt1 


to find coefficients d;(¢ = т + b - 


"у minimum. These coefficients may be shown to satisfy the 


dV, isa 
+1 


(ка + озу, of V, — 
i=m 
equations Р 


тер 
> (dji aa а) 4; —А= Фа” Amm 
i=m+1 


e "уе ГА =-1. 


i=m+1 


(jo т+1,..„т+р) 


Solution of these equations gives rise to values for d; and А. 
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9-11. The following particular solutions exist for these equations: 


р=1 p=2 БЕЗУ 
т | л 

[^ A [m [^ A dm Ima Фила | 

| . — 

1 1 | 0.222222 | 2.327 | —1-327 | 0-030612 | 4-081 | —5-388 2-306 PLE 
2 1 | 0-108889 | 2-230 | —1-230 | 0-006949 | 3-763 | — 4-636 1-873 фр inte 
3 1 | 0-067347 | 2-177 | —1-177 | 0-002469 | 3: 1:652 о 
4 1 | 0:046838 | 2-143 | —1-143 | 0-001113 | 3-475 1:517 | 0-0 (029 
5 1 | 0-034974 | 2121-1121 | 0-000582 | 3-402 1:432 | 0:00 CD 
6 1 | 0.027391 | 2-104 | —1-104 | 0-000337 | 3-345 1:368 | 0-000 

ul 


== КЕ 
, ; à ЈЕ), 
It may be seen that these coefficients tend to binomial coefficients, i.e. d,,,;=(—1)! (") 


and this may be proved mathematically without much difficulty. (In the limit, - 
coefficients a;,— т May be accurately represented as polynomials in j. It follows t^: 


a limiting solution is given by the differencing process, дам = (— 1) a and A = 0) 


Estimates of trend components are thus given by variate-differences of the и. 


а ТЕ " Е Я — 7 
2-12. The function >; (— дны, LA V; thus estimates the trend in Vans if Yngi Puer 
t=m 


are free from trend. Its variance is given by 


(9) (5) чизе, 
act У (дун VIN ?т+ї+) ) Ло 


N ve 2m-2) ^ N° 
TA d m 4j ) 
This is of order mi-?», The following table gives the values of А: | 
| 
Values of p 
"m | 
0 1 2 3 4 а 
мне ол Ше” 
1 3-0000 0-2222 0-042229 ne | 
3: | 2 0-01591 ? “008 
3 ош | 01089 | 0-010522 0-002394 Ма 0.000330 
4 52531 | 09613 0-003935 | 0.000599 0-000142 0-000007 
5 5-8187 00886 0-001833 | 0.000200 0-000035 0-000007 
| 56 6-3346 ooz | eee поо 0 ра са" 
7 68118 0-0222 0-000364 preces Шын - 
8 73518 0-0185 0-000242 c: = 
9 7-6781 0-0157 = = у а 
10 8:0766 == e = ho = 
For completeness, 


the variances E: 
of the У, have been included in this table in the column 7^ 
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2-13. The efficiency of estimation of trend components using this approach is given in the 
following table: 


m 1 2 3 

1 100% 73% 49% 
2 100% 66% 39% 
3 100% 63% 34% 
4 100 9; 61% 31% 
5 100% 59% 98% 
6 100%, 58% 27% 


It is again true that the efficiency decreases as m and р are increased. However, the loss in 
efficiency may be easily compensated by including one or two extra values of V, ie. by 


Inereasing p. 


2.14. The two processes of estimating the residual variance and the trend components 
may thus be carried out using differences of the quantities V. However, it must be remarked 
approximate basis for estimation. An alternative, which is 
ledge the fact that the variance of V, increases 
ably chosen scale instead 


that this is not the only possible 
likely to be somewhat better, would acknow 
roughly linearly with /(m) by extrapolating to zero using a suit: 
of эп. 

One such scale appears to be provided by ү(т--1)-1. Ifthisisemployed, with the method 
of divided differences, more efficient estimates can be obtained. The extra computation 
Would, however, appear to be seldom worth while. 


2-15. It might be asked what is the probable bias resulting from the failure to eliminate 


trend completely. 
Suppose, for instance, that extrapolation is с 
Component o? is present in И. The error in the extr: 


m* 


arried out using Vp ... Гар» but that a trend 
apolated estimate, Vo is then 

m(m + 1) ... (m4-p—1) 5 _ (m 4- 1) (+2)... (m--p) o 
01-4 mol. ( p! ei pi et 


However, the variance of o7 is 


р 8) 1 
() j pee _ Act 


рор БЕКІ? 
2.91 ye any ey, N 
( т }\ m+j 


а таша 
Thus the variance of the error m V, is 


(m +1) (m 4 2) ... (m+P) 2104 Хо нај 
[ p! N М” 


Y is of order ,/(m)/p. 
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p 


0-89 0:38 0-25 0:2 
0-98 0:38 0:24 0-1 
1-08 0-39 0-24 0-1 
17 0-41 0-24 0-1 
1-26 0-43 0-25 0-1 
1:34 0-45 09-26 0-1 
1-42 0:47 0:27 
1:50 0:49 -- 
1-57 -- -- 


бр 00-10: 0 iR O2 он 


и ти 


The following table gives a series of values for A’: 


As a consequence of the fact that A’ decreases as p increases, we may say that the bias 
in the estimate of У; decreases with ,/(р). 


3. EXAMPLES 


3-1. This theory may be illustrated using the two series given in the following table. The 
figures are in hundreds: 


4 1 2 4 1 2 i 1 2 { 1 2 
NEC 
1 |1000 | 1018 | 14 | 273 | авт | 27 | та 2 170 
2 | 905 | 927 | 15 | 247 | 465 | 28 | ет | 475 | 44 | 20 | 148 
$ | B9 | 847 | 16 | 228 | 465 | 29 | е | 445 | 45 | 17 | 128 
% | 741 | 776 | 17 | 202 | 468 | зо | 55 | 423 | 42 | 15 | 109 
в | | Te |) | 388 | e$ | mx | о | um | |ы | 
іші $e | 19 | 185 jaam | за | 45 | 5 | de | ua | 
ei | H5 | 25 | 10 | 48 loo | dj | ue | 2 | di | 68 
іі | а | 2 | 5 | dv | м | gy | Sor | uo | un | = 
2129] ae oe | de ж ТЕ |o | e] ~ 
BE ев, | a | ae | у | oe | <- | - |» 
Ш | 368 | 49 | 24 | 100 | d | 37 | 37 | 22 | 49 29 
12 | 333 | 483 | 25 | 91 | авв | зв | 57 | 245 | 50 ШЕ 
1e | МЕ | же | па | ва оа | 3 | = = 


Both of these series consist of a trend 
The trend in series 1 is given by 


Т, = 10° exp (5) G 


plus a random element, 


and that in series 2 by 


Consists of the 


functions from three-place tables, Thus, in series 1 


„= 105 E ees! ; 
т, [exp ( 16 ) to three decimal Places — exp( == a) 
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and, in series 2, 
u; = 10°] ехр | — 1—1) to three decimal places — ex LESE 
| 10 d *P\ 10 


1 (0-26) 
"дат P 200 


: 1 (i — 26)? 
to three decimal pl: р == 
в-расев — Tram) P — 200 | 


The value of ø? is consequently 816-6 for series 1 and 1633-3 for series 2. 


Table 3-1a. Estimates of the trend components in series 1 


Estimates 
V PT 
1st 2nd 3rd 4th 5th 
5 pec 25820232 | 25735398 | 25650200 | 2556536-6 | 25480674 
~ + 699) 
2 9663-8% 8483-4* 8510-8* 8492-4 8468-2 8442-6 
(+795) (442-1) (+ 13-08) 
3 1180-4 974 18-4 24.2* 25-6* 26:0" 
(3:867) (+ 33-1) (+ 8-00) (= 3-12) (+ 1-52) (+ 0-84) 
4 1207-8 —458 —5-3 -14 — 0-4 i04 
(+ 924) ( + 27-6) (+ 5-46) (x 1-80) (+ 0-75) (+ 0-34) 
5 1253-6 40:0 --44 -1:%0 -03. | — 0-3 
(+973) ( + 23-9) (+ 3-99 (+ 1-14) (+0-42) | (x 9-18) 
6 1993-6 = 356 -34 = 07 0-0 = 
(+1015) (+ 21-1) (+ 3-06) (+ 0-78) (+ 0-26) == 
д ^ 1829-2 = 32:2 -27 ==04 - = 
(+ 1053) ( + 19-0) (22-43) (0-56) 38 — 
8 136144 -295 mh - = = 
(+ 1087) (+17:3) (+198) = ЕЕ = 
9 1390-9 —27°5 ән E ји = 
(+ 1118) (+ 16-0) = ES E - 
10 1418-4 == = = = z 
(+ 1146) = a 
* Significant at upper 1% level. 
Table 3-16. Estimates of the trend components in series 2 
ыы 
Estimates 
V 
Ist 2nd 3rd 4th 5th 
T 2136689-82 |2119064-59 2101548-55 |2084135-64 
1 | 2173658-54* de 17695-93 17516:04 17412-91 17311-91 
z 19226-22* | 17142, 109-19* 103-13* 101-00* 100-23% 
Я 1483-72 ur 6-06 2-13 0-77 0-36 
1366-45 3-93 1:36 0-41 0-15 
5 { 9-02 Е © 
6 1358-87 1-91 9.57 0:95 0:26 — 
1356-35 Š 1-62 0-69 = — 
1 1358-26 m 0:93 5% = = 
5 1362-74 = Aa ae i == a 
10 1368-84 = 70 ар. E = = 
1375-87 E^ 
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3-2. Tables 3-1@ and 3-15 give estimates of V, — И, for the two series together with the 
2. 6 s V, (m = 1 to 5). 
s of successive differences УИ, | 
= bracketed figures in table 3-14 give the standard errors of the estimates, calculated 
using с = 816-7. Those for table 3:16 would be ,/(2) = 1-4 times these values. 
о 


3-3. Examination of the estimates for series 1 shows а trend component of about 26 in 


V, and a negligible component in V,. Similarly, for series 2, a trend component of about 
100 exists in V, and a negligible component in V}. 


i ssi з іпа- 
Estimators of У) may therefore be formed using values after V,. Some possible combine 
tions are given in the following table: 


Series 1 Series 2 


УИ, 919-0 1549-1 
= 893-8 1522-6 
PeF 969-4 1484-9 
Ре Ку 969-4 1421-9 
Vis 1053-4 | 1349-9 


| | 


The coefficients of variation for those estimates vary between roughly 
100,/(2-8141) = 26-1 % 
for the first pair and 100 (45141) = 33-1 % 


for the last pair. None of these estimates differs from its theoretic 


" n 
al value by more tha 
its standard error, 


3-4. To set more exact confidence limits, it is necessary to calculate the efficiency factor? 
E, of the estimators, using the values of A given in the table of section 2-8, These are give” 
by E = 1). 

Confidence limits may then be calculated int 
of variance based upon ÆN observations, 

For instance, the efficiency value for the fou 
effective number of observations is 41 x 0-5102 = 21, 9/ limits for сте for series 1 819 
thus 969-4/1-69 = 573-6 and 969-4 x 2:04 = 1977-6, а мани 


; jmate 
he same manner as for an ordinary estim 


rth estimate 18 2/3-92 = 0-5102, so that the 


4. CORRELATION ANALYSIS FOR RANDOM SERIES 
4-1. For any two series it is possible to calculate not only the Уз estimating the residual 


D h 

ies. These C's may be calculated i estimating the residua] covariances betwee? А 
series. ; calculated in an exactly similar тат эв except that €? 
square has to be replaced by a product. "ist ad: PRIN 


CES FN 


е 
Е ‚М +10) ar iti to replae 
in calculating "E ) are used, it is necessary 


us, for instance, we may set пр & serie? 
б = (03) (WY) + (V190) (уш, 
N 


equations 


84... + (Viten) (улоу) 


Q, = У) (Убу) + (уз, (Уб) +... 44 
N 


(Very) (УЗу ла) 


ete, 
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2 " 2 
4-2. A curious analogue exists between the formulae for the covariances and those for the 


variances. The pth covariance may be written in the form 


2p 2p(p— 1) 


@ = a y) -— оозу (2,9) to - 
Cp = соу, (2, y) Spiel а рау ml bius 


Viste cov (2, y) = COV (ауд), 
cov, (2, y) = 21007 (зуда) + COV (ar yl 
cov; (x, y) = &[eov (аца) + COV (Titos 1)]- 


The formula for V, is the particular case of this where 2; = Yi- 


4-3. Now, if the correlation between z and y is p, and no serial or lagged correlation exists 


in the two series, it is not difficult to show that 


202 


би 


var [covo (2,9) = (15-29% 


l+p%o2c2 p 
var [cov; (v. y)] = aie (6 = 2,2, ...). 


cov [cov; (<, у), cov; (v. y) = 9. 

Since the factor (1+?) occurs in all of these expressions, any process which minimizes 
the variance of a combination of the С/в does so irrespective of the value of p. Consequently, 
the processes derived for the J; (i.e. where p = 1) may also be applied to the C;; the residual 
Covariance between 2 and у may be estimated by extrapolation in the same manner as the 
residual variances were estimated. 
lation coefficients may be calculated using variances and 


4-4. For the same reason, corre 
ar fashions. The variances of these will be 


Covariances which have been estimated in simil 
a-e}? 
EN ' 
ation. 
following table, assuming that extrapolations 
d in estimating the correlation coefficient. It 


able of $2-8 into two (the minimum possible 


Where И represents the efficiency of estim 
А The values of this factor are given in the 
involving Va to Vip or Gn +1 to бир аге USE 
15 derived by dividing the values of A in the t 
Value for A). 


Efficiency of estimated correlation coefficient 
Г T Ж 

т 1 2 3 4 

. 0-781 0-826 0-851 
1 ЧИ 0-649 0-692 0-727 
5 0:433 0-567 0-631 0-671 
3 0-381 0-509 0:575 0-617 
^ 0-344 0-466 0-532 0-575 
5 0-316 0-433 0-496 0-539 

ў : 0. 

6 0:294 0-406 0-467 510 


correlation coefficients calculated by the 


distribution of 4 
m the distribution of ordinary correlation 


4-5. Tt seems unlikely that the 
greatly fro 


extrapolation method would differ very 
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coefficients, since, in the limit, the estimates are 100 % efficient. It would раннее. 
be reasonable to test any correlation calculated by the extrapolation p " i ld ed 
an ordinary correlation coefficient based upon En pairs of observations. This should g 

a good approximation to the true state of affairs. 


5. EXAMPLE 


5-1. An example may be provided using series 1 and 2 of the last section. These series have 
a theoretical covariance of 816-7 and a theoretical correlation of 0-707. 
Table 5-1а gives estimates of C, to Ср and their differences. 


Table 5-1а. Estimates of the covariances between series 1 and series 2 


| 
Differences 
p 0, | 
lst 2nd 3rd 4th 5th 
1 | 1561690:09 | — 1551285-16 | 1542029-34 | —1532770.58 1523540-43 | — 1514343-08 
2 10410-93 — 9255-82 9258-76 — 9230-15 9197-43 —91099. 
3 1155-11 2-94 28-61 32-72 33°77 = Је 
4 1158-05 31-55 -411 1:05 — 0-38 о 
5 1189-60 27-44 —3-06 0-07 —0-19 йб 
6 1217-04 24-38 —23.39 0-48 — 0-13 = 
1 1241-42 31-99 — 1-91 0-35 - = 
8 1263-41 20-08 —1-56 = = == 
9 1283-49 18-59 e Es үзі - 
10 1302-01 — " ње E — 
Mv 


This table shows similar characteristics to 


tables 3-1a and 3-15, trend being largely 
eliminated in fourth and higher differences, 
5-2. Estimates of covariances may be obtai 


: Н of 
д : ined in the same manner as estimates | 
variances. For example, using C.-C), we get 


4х5 4х5 
Oy = п-ов, 


= 1158-05 —4 x 31-55 —10х 4.11 " 
= 1:04 — у а 
= 945-8, 04—35 x 0-38 56 x 0-19 


E o Estimated 
ng Covariance 
зе ышы. 
кенес ee 
C.-C, 945-8 
Ср Су 962.2 
р 10 977-3 
Ci 1004-6 
СО 1046-6 
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5:3. і 
4.2. estimates may be used to find the correlation between the two series. Th 
бе г table gives a number of estimates of correlation coefficients together with s 
ive number of pairs of observations on which they are based. s Е 


Estimate using Coefficients No. of pairs 

с, 0-658 27 

6,* 0-764 21 

с, 0-873 18 

с, 0-901 16 

с, 0-912 14 

с, 0-919 13 

с, 0-924 12 

С, 0-928 11 

с; 0-930 11 

би 0-932 10 
C.-C, 0:793 29 
бб 0-811 97 

O,- Oso 0-815 23 
бб 0-856 21 

| Cig 0-878 18 


* Biased by the existence of trend components. 


А. t may be seen that, apart from the first three estimates, which are biased by the trend, 
hs | t obtained using serial estimates of covariance are better than those individually 
tion ny one coefficient by the use of the z transforma- 
Ms - For instance, the value of 2 corresponding to the coefficient 0-793 is 1-080. This has 
h andard error of roughly 1/,/(29) = 0-186, and its 95 % confidence limits are consequently 
oughly 0-715 and 1-445. The corresponding 95 % confidence limits for р are 0-41 and 0-89. 

It is perhaps worth noting that the actual sample correlation between the two series when 
‘heir trend is removed is 0-781. The agreement with this calculated correlation coefficient 


18 very good. 


5:4. Approximate limits may be set for a 


6. THE EFFECT OF SERIAL CORRELATION 


mption that the series used are serially 
d, the accuracy of the estimates of 


& 


al correlation exist 


Modifications in the ordinary procedures need to be adopted. 
ion will be largely concentrated upon the methods of 


ауре: А 1 
ето three questions. These are: 


(1) How may serial correlations in th 
x May any existing serial correlation 
(3 regressive type? 
and М procedures exist 
rial correlation exist? 


e series be detected? 
в be fitted into a scheme of the moving average or 


for testing the correlation between series where both trend 
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А А " а i rtant 
6-2. Before starting on a more serious consideration of these questions, one impo Fal 
prm а м H + ne 
int might be noted: If a large number of variate-differences are used in estimating ie 
om à эү дар cd ; i rig в а 
: riances or covariances, these will give in general unbiased estimates of variances à 
vai 
covariances. 2 
This will be apparent if it is remembered that 


"T 2p 2p(p-1) . | 
VELIE р (р) р)" | 


А қ сас or by the 
and that the estimate of V, obtained from V,,,, to Vaim by either the most exact or by 
extrapolation method, will be of the form 

Vo = WI -a,r — аут аз...) 


, 


where а; > 0 as р = оо. 


f ы — ; Ж? А ,mptotic 
A more rigorous proof is outlined in Appendix 1. It indicates a desirable asympto 
independence of the serial correlation in the series. 


" А "eciable 
6-3. Of course, in practice, we cannot let p become very large, but, even so, an apprecta 
reduction in bias may be achieved by the use of several of the V's (ог C's). 
For example, if И, V, ..., У,, are used, the most exact method gives an estimate 


while the extrapolation method gives an estimate 


, 2 4(p—1) 
Бри | oe — е 
ы iL РТ  (prl)(p43)/2 


Both of these are liable to differ very little from М» 


6(p—1)(p—2) 


wore | 


6-4. To illustrate the actu 


аб 
al sizes of the bias, the following table gives the values ae 
would be obtained for о? by 


the extrapolation methods on different series for which 9* ^ 


These values are reasonably Satisfactory for 


Series 
^| Moving Moving 
Estimate Markoff Auto- Auto- average averag 
using regressive regressive 
р-0-5 @= 1-1, | 4-14, р — — 0.64, 
Ь = 0:5 8-08 ру = 0-5 Ра = 0-30 
V, 0-500 0-9 | ae (а ей 
V; 0-365 о | 0889 0:500 ron 
Vio 0-349 0-058 9-039 0-167 2.352 
T.- Ys 0-796 0. 0-031 0-091 2-648 
2 sal 1179 035 
ТГ 0-668 0-594 Д 0-909 1 6 
Vs-Vio 0-477 0-175 1-071 0-818 1:096 
e 0-201 5 445 
з= 0-386 0-078 2 0-545 1:44 
9-047 0-273 9.040 
— — |i eee 


Ho 
Й-И, and 7-0, but are poor for W^ 
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6-5. The first of the above queries may be reformulated to read: If V, ... Vn+p may be 
assumed to be without any trend component and 4+1 Ра+2 --. may be assumed zero, how 
may we estimate ру, ры... Py! 


Let us consider, first of all, the case 4 = 1. 
One approach would be to express the means and variances of Гл --- Йл+р in terms of 


Pand б, and thence to obtain maximum likelihood, or for rough uses least square, estimates 
of p, and с. These would be quite complicated to obtain and would still provide only a first 
step towards simplifying the serial correlation picture; it would undoubtedly be necessary 
to reconsider these estimates as p, and higher serial correlations were taken into account. 
A simplification of the procedure, to allow more rapid approximate testing to be carried 
out, is therefore desirable. | 

One possible simplification is to ignore in the first instance the лы of the variances 
of the V’s on the p;. It would then be possible to estimate 07, р; 0*,... by least square 


Procedure. 


An equivalent method to this is to estimate руй, from the differences 


7 


Ун x: Vo: Len PY Vis Lt. ино ES hos 
and to use this to correct for the bias in the value estimated for V, on the assumption that 
Pı = 0. This approach is the simplest mathematically and numerically. 
6-6. We now consider a series of functions, Аул, Ps, >>> defined by the formula 
(2+1 (2+2) ту 
у ние то ан VE, 
lt is not hard to show that 


" 9p(p-1) , И ин. | 
№, n[n-5 3^ tprp” (p+3)(p+4)(p+5) * 


80 that if p, = рз =... = 0, we have 


E(Ryy) = 010% 

2g)! 

N 1 (22+290! — 
gi cov (Ry Ва) = (ра) (pq) — 2r! 
toset up equations similar to those of $2-6. These 
À for estimating the first serial covariance 


(p+2)!p! (g 2) 4! Зра + 2p + 291 
941 4pqg+2p+2q+1" 


6.7. Using this latter value, it is possible s 
"Day then be used to give the coefficients c; an 
| i Jue: À: 
and its variance. The following table gives the values of 


? 
т ә 3 4 
1 2 
1:33 1:25 

1:50 4 
Н 302 2:37 ps 2n 
! 673 3-54 343 2.52 
2 54 5:03 4 2-91 
Я ae 6-86 442 4-54 
$ 21.47 905 hi 
5 21: 

26 
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6-8. These values of A might be compared with those obtained by extrapolation: 


p 
m T ——— - 
2 3 4 | 
0 1-52 1:39 1:32 
1 2-56 2:15 1:85 
2 4-15 3:29 2:88 
3 6-37 4-88 4-03 | 
4 9-28 6-93 5-38 
5 12-95 9-56 8-71 
4 | 


In this instance, direct extrapolation is not аз efficient as previously, owing to the rapid | 
increase in the variance of T, with p. This is of order (p+ 191, 4 

Better estimates may be obtained using the device suggested іп $2-14 of using ап 
alternative scale as a basis for extrapolation. In this instance, something like a quadrati 
scale is suggested, and the use of the scale (+ 3)? appears to be fairly efficient. 


6-9. It is fairly apparent that anything like a full analysis to estimate the higher serit 
correlation coefficients would involve a great deal of work which could not in general 4 
justified. The prime need is for some fairly rapid method of gauging the likely serial ооб 
tions in any series using their variate differences. The above analysis gives an indication 
how this may be approached. 


In general, it is possible to define a series of functions 


T 2m — 1)(; 

By = e (2m— ! A Е УВ, 1 
such that if the serial correlations higher than the mth are zero, the expectation of Вир ® 
Ру 0. These may be used for general investigational purposes. re 

The variances of the Rmn increase vary rapidly with p, во that the higher values ж 
unreliable and, for extrapolation purposes, cubic, quartic and higher-order scales ® 
required. 

7. EXAMPLES 

7-1. As a first illustration of these functions, series 1 a 

7:16 give the estimates, Ry, Of serial correlation for the, 


Table 7-1a. Estimates о 


„aand 
nd 2 may be used. Tables 7 peo 
Se two series. Examination 0 


of serial covariance for series 1 
0 156 2nd 3rd ath pth 
cum — 
2 209 668-8 ir 12825289.9 17776284-4 | 29551645-6 971027724 
3 1180-4 Pi. 127936-0 9361156 | 3766500 5454020 
4 1207-8 = о 1445-5 1755-6 HE 69217 
5 1253-6 = 887-0 714-0 - 680-4 4785-0 Е 123:3 
6 1293-6 = Fo a ~1016-4 gd Z 5888" 
7 1329-2 — 1159-2 1554 — 1429-1 "rari = 
8 1361-4 — 1397.5 e —9559.4 i 
9 1390-9 — 1512-5 0 ===> = = 
10 1418-4 = кз = ШЕ 
ii 
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Table 7-10. Estimates of serial covariance for series 2 


399 


Т | 
2nd 


3rd 


4th 


5th 


— 386-65 


Ist 
= — 
1 6463296-96 
2 106455-00 
1172:70 
4 121-20 
5 4242 
6 — 53-48 
а — 161-28 
8 | 274-50 
9 
10 


| 10594736-62 


1037-85 
1233-65 


| 14464143-22 


484447-60 

7950-26 
— 623-28 
— 85470 
— 1009-14 
— 1018-16 


17973894-37 
765799-30 
16840-88 
545-49 
430-23 
29-32 


|||| 


| 21032116-20 


tables is sufficient to show that, in both instances, trend apparently exists in the third 
Variate-difference and that the first three serial correlation coefficients are likely to be 
negligible. At first sight, the largest would appear to be the first serial correlation coefficient 
Of series 2, but an extrapolation using 755-79 gives an estimate of 117-32 for the serial 
Covariance, This is well within the bounds of chance variation. 


7-2. This analysis was carried out for four other theoretical series. These were: 
Series 3. Eighty terms of a Markoff process with p = 0-5. In fact, the first eighty terms 


9f Kendall's (1949) series 11. 


Beries 4. Eighty terms of a mo 
mined by u; = 2¢,-+€;-1, where 6; 
Series 5. Eighty terms of the autoregressi it 
the first eighty terms of Kendall's (1945) series 3 multiplied by 


0-05. 


Series 6. Series 5 plus a trend component determined from 


The estimates of serial correlation for these series 


i "em 
Т, = exp — + sin di? + 


(4-30 


Ін 


1000 ” 


ving average process for which p, = 0-4. This was deter- 
was the random element from Kendall’s series 11. 
ve process u; = 114; 1- 0-8и; +E; In fact, 


are given in Tables 7-2а—7-24. 


Table 7-2а. Estimates of serial covariance for series 3 (0? = 1:33) 
З C - 
[ 4th 5th 
ЕС 2nd 3rd 
PN 0 lst = 
"и е 0-035799 | 0-149848 
-267895 0-055272 К 14: 
1 | 0-812338 | 0404673 | 55 | 0021483 | —0-086874 | 0.075819 
2 | 067747 | 0243036 28415 | 0081538 | -0138569 | 0.019698 
3 | 0.636701 | 0152570 | 9219, | 0.152082 | —0-149928 | 0-109313 
4 | 061634 | 0091515 | ОУ | 0-215688 | 0208984 | 0390941 
5 | 0615433 | 0053004 | 0.070200 | 0288913 | —0-928416 = 
: 0315 | ae E. 
7 | овцама 0.022140 0004588 | 0 389964 = 
8 | 0.611169 | 0021645 |— 007045: = 
in 0-610688 | 0-028050 = adi 
0-610178 — 
Mel cem | 
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Table 7-26. Estimates of serial covariance for series 4 (0° = 5-0) 
m 0 | Ist 2nd 3rd 4th 5th 
Bs | | = 
p | | ы Е Ds 
nf 3-561117 2-503398 0-273930 |— 0-306278 = 0-239615 appe 
2 2-720051 2-339040 | 0-492700 |—0-119911 — 0-742828 par 
3 | 2836811 | 2-141960 | 0-556938 | 0-342293 | —0-966501 и 
4 | 2122615 | 1-982835 | 0-414316 an | e 
5 1-990426 1-894053 | 0-136206 1-293402 — 1-305607 1.2158: 
6 1-900233 1.871352 |-0-216540 1-762103 — 1-826737 = 
7 1-833399 1-900224 |— 0-617018 2-324176 -- = 
8 1-780615 1-967535 |— 1-063975 — - == 
9 1-736892 2-064260 = = - = 
10 1-699360 - — | = — = 
Table 7-2c. Estimates of serial covariance for series 5 (0? = 0-0905) 
Tm" 
m 
б 0 Ist 2nd 3rd 4th 5th 
nt | 
Е r 7 | 9510146 
1 0:02762339 0:04650171 — 0:02157966 0 025 3533 
| 2 | 001212332 | 0-02727420 | 0.09937975 —0-00104210 |—0 0100205; 
3 0-00757762 0:01552230 0-01838865 g 0-00989108 |—0 0041656 
4 0-00602539 0-01026840 0-00956984 0:01612968 0-01229467 0-006: 3670 
5 0-00534083 0-00821772 0:00419328 0-01091376 0-00916798 00098 
6 | 000494951 0-00751884 | 0-00121630 0-00762269 0-00495225 сый 
Т 0:00468098 | 0-00735660 |— 0-00051563 0-00609892 — = 
8 | 000447663 | 0-00741285 |. 0.00168850 5e — - 
9 0-00431190 0:00756635 ES ~ = = 
10 0-00417433 — — em us ail 
Table 7-24. Estimates of serial covariance for series 6 (с? = 0-0905) 
т 
N 0 h 
p^ ue 2nd 3rd 4th " 
— - p o 
— = Г 8 
1 | 0-02943038 | 0-0519597 балт 0140 
| 2 | 001211046 one 004107950 | 0-01625029 |--0-00570086 |—0 un 15 
3 | 000755845 | 001559390 | 002947215 | 0.02068429 |2 000018025 |- MAU А 
4 0-00600613 0-01026855 d 1839128 0-02116978 0:00986637 = 000656084 
5 000532156 | 0.008215 (00957054 | 0-01614690 | 0-01226992 0 0087439: 
| 8 000493024 | o.00751 900 000418824 | 0-01094148 0-00902555 ав 
7 000466168 000735919 |_ on 0-00770154 0-00527813 == 
8 | 000445726 | 0-00141870 |20. ош 000607750 = = 
9 | 000429240 | 0-00757419 ni = — = 
10 0-00415468 = Би = = — 
е 
Examination of these +; ten? 
ables shows that x18 
i ex 
of serial correlation. There is very little а "= case could we be in doubt about the a fro” - 
1 2 
these tables alone, there might easily be erence between Tables 7-2c and 7:24, ae co 
ponent in series 6, Some doubt about the existence of a tre 
02? 
Table 7-2с indicates wh ut 
at is likely + 9 gh 
regressive вс У to be а common characteristic of oscillator uh? 


M. H. QUENOUILLE 401 
the corresponding estimates of variance. This is due to the fact that the latter will tend 
to be greatly reduced by the serial correlation, while the former will not be reduced to the 


Same extent and may even be increased. 
7-3. As practical illustrations of this approach, it is convenient to consider Kendall’s 


(1943) ungraduated sheep series for 1 867-1939 and Davis’s (1941) figures of monthly freight- 


car loadings for the United States. 
D i . H +. 
Estimates of serial covariances for the former series are given ш Table 7-3a. 


Table 7-3a. Estimates of serial covariance for Kendall's sheep series 


р MC 0 | Ist 2nd 3rd 4th 5th 

1 | 3177-8585 5180-0270 2607-2578 875-0051 1306-9226 
2 | 1155-9531 3317-7000 | 1926-6983 - 194-2952 — 946-0034 
3 | 663-0031 22855402 | 2047-5931 | 450-7071 | — 424-8454 
4 977-6085 | 1432-3764 1818-1422 695-8102 — 1471-9133 
5 670-7607 826-3290 1522-9500 | 1423-6794 — 2034-3657 
6 | E 410-0790 | 1011-8856 | 2295-5504 in 

7 383-7906 E 181-0050 305-5624 -— md 

8 | 370-5146 458-4000 122-2430 — = = 

9 360-3278 447-2030 = — = = 

10 | 352-1952 иш = == ры = 


There would seem to be little doubt about the existence of either a trend component or 


Serj керү" 3 . 
“nial correlation in this series. | | 
Trend apparently шау be regarded as being confined to the first two variate differences, 


zn Which ease an estimate of à? may be der ived from the third and higher variate differences. 


This estimate is more than 3700. | . 
] characteristies of these estimates and 


There is a marked similarity between the genera. Во 
6. The large values taken by R,, and Rop indicate that a second-order 


tho, 
8e of series 5 and 
o represent this series. 


toregressive scheme is likely to be needed t 
7-4. The analysis of Davis’s freight-car loadings requires special adjustment because of 


i existence of seasonal variation. This could be easily carried out by an extension of the 
Analysis of variance technique. It required the extraction of seasonal components from the 
rmulae of § 2-3, using m = И instead of 10 as usual. Starting with 167 observations, this 


Save us a value for У,167-11-12- 144. 


ates for serial covariance for freight-car loadings 


Table 7-4а. Estim 
^ x x ist ond 3rd 4th 5th 
Be Sy 
265-92 320-355 483:472 
1 35-608 415-660 265 987 зе : 

5 900-290 ды 218 225-705 16-772 xd 75:213 а 
3 621.754 495.930 216-720 | 63-571 * 17 8-763 
4 524-052 434-010 190-232 123-396 - 197182 - 65- 781 
5 474-450 393-246 149-100 190-080 - E — 108-848 

5 206-790 368-396 97-200 | 29685 E = 

5 413.042 355-428 i ew — = 

5 403-769 350-640 B = == - 

10 395-977 351-175 ыз = — — 

389-592 == 
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The estimates of serial covariance for this series, when seasonal variation has been 
eliminated, are given in Table 7-4a. | 

There is clearly a high trend component in И, but apparently not much beyond this. 
There is also clearly a bigh first serial correlation coefficient, а low second serial correlation 
coefficient and not much beyond this. These values suggest the fitting of either a Markoff 


process or a moving average scheme of length two. There is a fair degree of resemblance 
between this table and Table 7-2a. 


8. CORRELATION ANALYSIS FOR NON-RANDOM SERIES 


81. Before discussing correlation analysis for non-random series, it might be noted that 


the serial covariances of the variate differences may be directly calculated using the у. 1 


Пра is used to denote the gth serial covariance for the pth variate difference, then 
В = У-У?) (V?z, а) 
N 
р 


and the following identities apply: 


2p+1 
Dj = Вт а» 


B= yee қ | 2(22+1) (2р+3) " 


? pl "5 7 (pel)(pr3) we 


Deg = го а 1222 * 1) (2p +3) T 
OFI (р--1)(р--2) рта 


~ 42 +1) (2p +38) (8р+5)р | 
(»-1)p-3)(p-3) ”* 


These may be used to calculate the serial correlations of the variate differences UP Е 
any order. 

8-2. A direct application of these formulae occurs when we consider the effect on the А of 
any linear transformation of the т. For instance, if the transformation 


Yi = (pr Ми + pe 
is carried out, the revised У, are given by 
пређу 


nee | 
Similarly, a transformation of the type 


у; = (т; — ал | + bz; „Ма +a? +b?) 
gives values =й 2a(1 +b) 2b 
1+а фа “а + Гагра Ва 


8-3. Correlation analysis may be carri i 
ried out for ghi 
t forrand i non-random series in a similar f? 
to tha random series. However, unless at least one of the већ; —— ығ ійс2 
of the resulting coefficients ig liable to be incorrect} /€8eriesisrandom, the sig? 
As an alternative approach : сау estimated, 


on 
ce 


random elements. This is the approach which vil егіев so that they consist of uncorr? а 


© considered. 
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The maj іп 
| ajor alternativ: i i 
а ak таты е to this approach is to adjust the degrees of freed ul 
This might be а ctor which would take into account the егіні со : indie 2... 
19883. one using Bartlett's (1935) adjustment. (See min а стани 
| 5 2 example, © і 
a) ple, Quenouille 
Ь at the best method 
inka h ethod may be for determining a suitabl ri 
values w у МИР = maia i 
lues when trend exists is not clear. However, one ете ел уз miri 
1 H А oach which а 
is to employ the transformation so that pim рат E 
j 5 of serial 


be r 
reason: a 
asonably satisfactory 
b des А 
е done using either the individual values ог th 
` the 


COvariz x J; 
tod, of $6 disappear. This may 
ates. The method will be clarified by examples. 


9. EXAMPLES 


9-1. Iti T 
1798 нчен = calculate the effect on Table 7-2a of adding D, to the esti 
four values of 5 n 08 able 9-1a. The estimate of the first serial covariance fr mates; 
from Table | able 7-2a is 0:673841, while the corresponding estimate of tl om the first 
lim Tubos la is 0-831860. Thus, 0-673841/0-831860 = 0-81 times Table 9-1 и 
› 7-2a gives a zero estimate for the first serial correlation атый жаш 


Table 9-1@ 


\ 
N m 
р \ 0 Ist 2nd 3rd 4th EN 
1 
н i 
_0-203832 | 0-540117 | 0229965 | 015 75 % 
5 20: 5 51758 5 e 
^ — 0-383872 0.402138 | 0-121070 | 0-074023 Pc 01062521 
3 | —0450804 0.353510 | 0082915 | 0-039598 0-056002 |— ТОО 
с 245 | 0-329820 | 0-066416 0.010752 | 0-106354 ыш 
6 5 0:315588 0-062832 |-0-034368 ==. == 
— 0-523261 0-305116 0-075720 — == <= 
— 0-534158 0-295020 — — = = 
— 0-542353 — — — = БЕ 


= (в; реа) +p), where 2p[(1 +") = 0-81, 


a transformation y; 
all of the serial correlations may 


This 
18 corresponds to 
ade in the above instance, 


US coy. 
и 0-51, and, if this is m 
usly be neglected. 

9-9 
- The existence of trend in 


ter 

ms š 

Using ¢ For instance, if trend exi 
i he second to fifth terms. The estimat 


1. 
5; p= 0-52, 


ould have necessitated the use of higher 
ate factor might have been estimated 
ave been 0:55982/0-685038 = 0-817 


the above series W 
sted in И, the appropri 
e would h 


s necessary to caleulate the perturbations 
may be used to set up simultaneous 
higher order serial covariances equal 


9.3, 
due " For more complicated schemes, it i 
quai Dj Л,» and to consider these jointly. These 
to > ms to find coefficients п ake the 
Zero 
For i 
| а for series 6, the 
B. more than the values in t 
p estimates, it is appare” 
nting the series and that at leas 


ecessary to m. 
ations due to Рр and Dy» are as in Table 9-34 

а to be subtracted from Table 7-2d to 
ћете would not suffice as a means of 
e scheme must be used. 


perturb 
able 9:34 would nee 
tthata Markoff sc 


хер 
$ а second autogressiv 
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Table 9-3a 
Perturbation due to Р), 
m 0 Ist 2nd 3rd 4th | 5th 
p ы |- — 
ZIISTO -00283112 
1 0-01126469 | 0-03525495 | 0-03410505 | 001768935 | 0-00511876 0 MUT 
2 | —0-00048696 0-01479192 0-02146980 0-01370809 0-00280239  |— 0-0 м 
3 | —0-00295228 0-00620400 0-01412618 0-01196438 әні 
4 | —0-00357268 | 0-00216795 | 0-00914102 | 0.00773 = 
5 | —0-00371721 0-00020916 | 0-006545 0-00362 — = 
6 | —0-00372717 |—0-00088172 | 0-00555 — — E 
7 | —0-00369568 |—0-00162252 = = = 
8 | —0-00365061 = = = = 
i L - 
Perturbation due to D, 
s^ 0 Ist 2nd 3rd 4th 5th 
қ Г i= кы жан си s Б 
1 |—0-00544013 | — 0-00659004. 0-01186480 0-02297309 0-02621533 003378085 
2 |--0-00324345 |—0-01370892 | 0-00454455 0-00258399 0-00675336 0-011 90321 
3 |—0:00095863 !—0-01189110 |—0-00592883 | — 000161810 |—0-00140879 | 00022932 
4 | 000023048 |—0-01019715 | —0.00525462 —0-00090090 | — 0-00273183 = 
5 000091029 |—0-00907116 |—0-00495432 0-00035806 — ud 
6 000134225 |—0-00824544 | — 0-00502470 — = Беа 
т 0-00163673 | — 0-00757548 == == me == 
8 0-00184716 — = — кегі = 
=: ЗИ 


To estimate the appropriate proportions of Tables 9:3a and 9-36 to be added to Table 


2 е 
7:24, extrapolates based upon the third to sixth variate differences might be used. Tes 
give equations 


0-06902037 + 0:04286921 А — 0:02305761B = 0, 


0:07588594 + 0:05124778 А — 0-01022722]5 = 0, 


or А = – 1-40, 


В = 0:38, 


These values might be сот 


pared with the true values А = — 1.39 B 
obtained using extrapolates А 


е 
= 0-56 and with 0108 
based upon the Second to sixth variate di 


.39, 
fferences, A = —! 


that both 1+4+B and 8B(1— p)— : 
exists for which А = — 1:40, В = 0-38, but if A у 8, 5 , 
are satisfied and а = 1-6, b = 1-0 ~ 


4 — ааа 
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correlation is, however, too small (p, = 0-06, p = — 0:29, р; = — 0-27, py = —0-07) to be of 


any importance. 
The variate difference analysis may then be derived by the appropriate multiples of the 


above table from Table 7:24. This gives the following results: 


Мэ» | 
РА 0 Ist 2nd 3rd 4th 5th 


| — 0-00144706 0-00159336 | — 0-00133238 0-00048857 


0-01126616 | — 0-00029675 
212 0-00074591 


0:01136509 0-000571 — 0:0025$517 0-00262992 |— 0:0017 
001126954 0-00160552 — 0-00399406 0:00370768 — 0-00224069 000072614 
0:01110929 0-00274667 — 0-00553892 0-00484840 | — 0-00265962 — 

— 0-00715505 0-00597672 = == 


0-01092618 | 0-00393359 15505 
0-01073887 | 0-00512609 | —0-00878507 = - - 
55579 | 000029744 — 


[EN 


0:01038086 — 


This table indicates that very little serial correlation now exists in the series. The 
differences of the revised values of И; may then be used to gain a better idea of the trend in 
the new series. А : T 

9-5. A similar analysis was carried out using Kendall's sheep series. This indicated clearly 
that à, Markoff process could not be used to represent the series. The equations for deter- 
mining 4 and B, derived using the third to sixth variate differences were 


7437-9107 + 4292-29674 — 2408-4016 = 0, 


200 5904-11214 + 2639-7281В = 0, 


6891-9 
from which А = — 1:42, В = 0-56. ; Е : 
The corresponding estimates using the second to sixth variate differences were A = — 1-35, 
В = 0-78 
К ter than those for the artificial series, though 


Both of these estimates are markedly grea | с 
the former esti to is very near the expected values for the artificial series. This serves to 
mer estimate is Very 1+ 0-8 will give rise to nearly random 


Show ifferenci rocess Up — l lUn- 
thet ie PEE t а sis. The following table gives the 


LM i ay then be used in à correlation analy 
2... je il 0:5, Le. ifa = 1'21, b = 0-76. It may be seen that with these 
1 =—1-4,Б= ,1.6- 


Values there is very little evidence of serial correlation and trend does not seem to be of any 


importance after the first variate difference: 


a 0 Ist 2nd 3rd 4th 5th 
= à — = 
23 8888 95-6965 и, 
2 | 910-7538 | -09256 | — 102-8280 310.9275 | — 727-9566 | —4415-6187 
У оа 103050 | — 232-3812 249-9381 1819-5162 - 
5 m 90-1909 | — 315-6038 -90:3925 - - 
6 | 902-8087 142-7166 —291 = = - 
7 | вөттііт | 1815221 = = = - 
8 | 892-6694 = 
~ 
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қ n 
This latter statement can, of course, be tested using тарыдан x ы” 1 
is instance there appear to Бе components of the order of 25:5 anc á к а E Жа 
EC 83 final analysis was carried out using Davis’s freight-car loadings. Here endi и 
Қ ен could not be accounted for by a Markoff process, ene E qn т a 
greater than one of D,, would have been needed to account for the serial correlation. 4 


à : ң 5 gave А = — 1:03, В = 0:38, 
order process fitted using the third to sixth variate differences gave А = —1 ; S y latter 
while one using the second to sixth variate differences gave А = — 1-02, В = 0:39. | ak for 
values correspond to а = — 0-55, b = 0:27, which is not unlike a moving average proces 


which p, = 0-44. The following table gives the values of V, for this process. 


0 Ist 2nd 3rd 4th 5th 
1 851-388 40.775 91.896 147-772 116-311 y 675 
2 837-796 - 14-363 - 13-655 57-308 73-878 80.180 
3 840-190 — 8:900 — 44-356 03:27: а 
4 841-080 3:773 — 87-388 58 2 
5 840-828 22-499 — 140:352 224- = Ex 
6 839-757 45-891 — 201-680 == = Es 
Т 838-118 72-781 -- -- -- = 
8 836-096 -- == — "EN 
a ie WES 


There is apparently a low serial correlation in the г 


ent 
evised series with a trend compo! 
of about 18 in the first variate difference, 


10. Discussion 

Since it is theoretically possible to 
statistics using the quantities V, re 
elimination should be capable of г 
difference method. For example, the 
is equivalent to using a smoothed se 
this method should be obtainable us 
Of course, as has been shown ( 
methods of eliminating trend are 
of the series, and, therefore, anya 
degree would seem to have а, el 
The analysis described aboy 


t 
reproduce any symmetrical derived second-momel 
sults obtainable using any method of general a 
eproduction using some modification of the Va alae 
use of symmetrical moving average difference {ор 
cond difference and, therefore, any results кеме : 
ing some modification to the variate difference me па 
Spencer-Smith, 1947; Quenouille, 1949), the comet тв 
liable to lead to biased conclusions concerning the ue er 
Iternative method which was unbiased or biased to 21688 


у calculations, but these would not see™ к 
nalyses now employed and would seem more ame? 
carried out. 
At this stage, many more 
otherwise of this method is 
in many ways, discredite, 
overcome many of the cri 


Practical proble 
established, Т 
а Variate-diffe 
ticisms to whic 


т 
PT 
to be tackled before the utili 2 ad» 
› however, to note that the ol ation’ 
can by some slight modific 


ms will hay, 
tisin teresting 
Тепсе method 
h it is subject, 


| 
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APPENDIX 


It is 
necessary i 
ssary to consider the effect of extrapolation on 


. 2i РЕЖ 
Уље Val 1-5 200—2) 
|| зац) "° =] 


wis = 
oe its is known for i = т + l,...,m- p- 
is not hard to show that if 


ља 
№) === for = т +1, ...,т+р, 


“ati а 
then t A 
he value of f(0) obtained by extrapolation is 


" (m4 1) (+2)... (m p) 
a(a- m4 1) (а+т-?)... (а+т+р) 


This fol j 
lows, since 74, = J! 
ут (а+т+ 1) (a m 4 2) ... (a+m+j+1) 


and 
о) = fom 1) + (n D vm CPP 
; be found for any function which may be 


As 3 š 
a corollary of this result, the extrapolate to zero may 
е extrapolate to zero for the function 


split j | 
plit into partial fractions. Thus th 
20 О... 9+0 | 
T а RT PP f = 
N= пена OD | nee 


equals 


жу ши 


(m t p)! ( »( 
1 k—1] (mopck)miki(g—1)! ` 


(m+p+h)! ok 


i Ma 


5, (—1)*-1(ф+Ё—1)! оп 0)! 
к=з (а-ілін(е-1)! т 


In thi 
his latter form it may be seen to be the (4 — 1)th difference of the function (3-455) E (mt N 
(m+ptk)!mtki(q—1)! ` 
Ifg> В а А i j 
Өш ү + p this function is а polynomial of order g — P and, consequently (as might be expected), the 
If polate is zero. 
9<т-+р, it may be noted that 


(2) (3) wyi 
(mak) __ T 8 + к. | 
ЕПА РЕНЕ ЕШ жары (р+Ё+?! 7 
(а+р—1! _ 4! i 
—__*—_.. Using these two facts, 


and that (4+#-1)! ; 
wèt- iffor ea scd 
he (д — 1)th difference of Ека is(—1) (a4-p— 9! (ара! 


ernative form 


(" м (2) р 
1)? о] PUP ) 


-p A Пә+1#—@1Шюр+®)—@1 


thi 
© extrapolate may be put into the alt 


pe HERES |3 
mXp-—2) (Q2 


ied by noting th 
all, and that eve 
+. For other val 
on for m = 0. 


ression is of order 1 (р, when m is zero and p is 
der of p (= Xp, say) the expression is 
ay be obtained by considering 


at this exp) 
n when is of the or 


т 
he proof may be comple 
ues of m, & similar result m: 


ar : 
SE th. q being regarded as 5n? 
E order (1 + X)-? at mos 
odification to the e: pressi 
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MOMENTS OF THE RANK CORRELATION COEFFICIENT 7 
IN THE GENERAL CASE* 
| 


By В. М. SUNDRUM 
Division of Research Techn iques, London School of Economics 


1. INTRODUCTION 


and variance of 7 are known in the general case and for a number of special cases 


The mean 
ven below in terms of p, the probability of concordance. Two 


Some of these results are gi 
Observations (жу, у) and (жу, yj) are said to be concordant if (v; — 2;) (y; — уу) > 0; then it can 


be seen easily that p is related to T by the formula 


т= 2р-1=р—% 


where q = 1—р. 
(a) General case 


ate of p (and similarly use a prime to distinguish sample 


If we write p’ for the sample estim 
arental values), we have the result of Hoeffding (1947) 


Statistics from their corresponding р 


for the general case: Е(р) =P di 
2 
ор) = pp T Ptn e-a m 


Where 4 is the probability that out of three members drawn from the population, one is 


concordant with the other two. 
(b) Special case: independence 


) has obtained 


Е(р) = 3, 


For this case, Kendall (1938 
(3) 
o(p’) = 18n(n—1) Т! 


case: normal correlation 


(c) Special 
due to Greiner and Esscher respectively, are given 


| b For this case, the mean and variance, 
Y Kendall (1948): 
Е(р) = lsin, (5) 
1 ага На 
у ale P) *2(n-2)]g- (5 S 320) | |- (6) 
n(n— 


Further, Hoeffding (1947) and Daniels & Kendall (1947) have proved that the distribution 
oftis asymptotically normal in the general case. Therefore, in the case ot large samples, the 
above results are sufficient to fix the distribution completely. But it is notorious that the 
distribution of correlation coefficients in the non-null case tends to be extremely skew, 


Particularly because of the limited range from — 1 to +1; and that a very large sample is 
» Casta шафрана ва БУО оов of Phib. лата Про ә, London: 
ora е: 
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required before the skewness of the distribution is small enough to be neglected. The ai 
of medium-size samples, which are likely to be most important in practice, a 
requires some consideration of the departure from normality of this sampling distri = же 
The third and fourth moments of this distribution are known. only for the null eed 
independence. The cumulant-generating function of the distribution in this case was deriv 
by Moran (1950) and Silverstone (1950), and from their result we obtain 


, 7 

H(p’) = 0, (7) 

100n4 + 32873 — 19712 — 997% — 372 (8) 
2700n3(n — 1)3 | 


pp) = 


In this paper, formulae for the third and fourth moments are derived for the general case, 


and for a special case of normal correlation. But first we prove a conjecture of Daniels & 
Kendall (1947) about the upper bound of the variance. 


2. UPPER BOUND OF THE VARIANCE oF 1 
Daniels & Kendall (1947) considered a 
which the sampling variance of t was m 
parent ranking was of the ‘canonical’ 
natural order, a certain number of the 
the correct natural order, and the other 
below, in terms of p. 

Let the parent population consist of У bivariate observations, such that the probability 
of concordance is p; this means that out of the N(N-1 
there are Р = pN(N — 1) concordant 
cordant with the member whose X. 


population of ranked bivariate observations in 
aximized. They surmised that this occurred when the 
form, i.e. an arrangement where the X's are in th А 
Y’s at one end or the other of the sequence wale М 
8 in the inverse order, А proof of this result is give? 


) pairs (each pair counted twice) 
pairs. Write P, for the number of observations vr 
-rank is i; then P = ХР, The problem is then to find 
i 

subdivision of a given P into a set of P's such that the sampling variance of p' is a maximum 
From (2) above, we see that in order to maximize the variance of p’, for a given p, we have 
to maximize k. . 

Take two quantities P; and Р, such that Р.> Е. The contribution to | from these tw? ke 
clearly P(P; — 1) and P(E; = 1). Now consider the effect of redueing Р, by one and increasing 
Р, by one. The contribution to Ё is then 7 


P(P;+1)+ (Pj 1) (D. — 2), 


ore the effect of such an alter 
ximized by increasing ав m". 

her Р, as much as possible. 1) 
gle observation is clearly us 


e of k. Therefore, kis ma: 


the followin 
X-ranks 1 2 3 N $ 
Y-ranks Ех ` Мм 
Concordances P P, р d 
for any Р, = (N — 1), the i y 


e 
to 


less than Y;, and all F’s t 


1 


= 
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11 


the left LE 
; of (7, Y; st ; 
E (i. Y;) must be concordant with the observations to the ri 
ian : ances are not reduced to their least possible value. Т. 2 Sop Sage пате 
cordances п 5 тшда. Таша сі 
aust occur at опе or the other end of the sequence. ТІ ~ мат и 
i ха . Therefore the ranki 
assigned thei for a given Р, as many А ssi veri 
n ~ ed their maximum value (№ — 1) and the remainde nio embeds e 
Е te > (2 ће а т are assigned their mini 
ка à : = B Ше minir 
fxs dos! e number of observations with (JN — 1) concordances. It can be iy ne 
s leads to a с: Я % > ў : cde | 
ids to a canonical ranking. This can be done in all cases, with tl "d 
А he possible 


excepti 
ption of one observation. 


whicl dmi i 
1 maximizes % is one in which, 


3. THIRD AND FOURTH MOMENTS IN THE GENERAL CASE 


1300) = Elp’ P- 3E) Е (р) + ЗЕ(р')% 
three pair anner of Hoeffding (1947), (p')* is the probabili 

this е drawn from the sample, with replacement, being concordant. We ma и кесі з 

dia о ability in terms of the following probabilities involving members of "4 ая 

without replacement. Then (р') is the sum of the following eight АКЫП! ў 

zing the same pair three times, (3) ig multiplied by the 


To find 


We first, 4 
sb ler ing i 
consider ( p')*. Arguing in the m 


(i) The probability of draw 


Probabili А 
bability that it is concordant, p: H Е 
( p- 
2 


are identical, but the third has one member in 


(ii) The probability that two pairs 
the probability, termed 4’ by Hoeffding 


multiplied by 


com ; n 
mon with them, 3 e ‚ 9(%— 9), 
le, one is concordant with the other two: 


that 
among three members drawn from 


6(;) => (n— 2) Ms 


are identical, 


а samp: 


t two pairs while the third is distinct 
bility of drawing two concord 


ability for the population is (p°): 


(iii) The probabili | 

" ( "d. gol probability tha 
2 5 ) ‚ multiplied by the proba. 
2)’, since the corresponding prob: 


КЕЛ 


as one member in comm 
Р Сы 
the third, 3(3) 2(n—2)(n—8), multiplied by the 


two concord 
th one of the remaining two: 


ant pairs, without 


re 
Placement, i.e. (p 


(iv) The probability that a pair h on with the second pair, and 


the 

other member in common with 
ple of four, 
dant wi 


a(t) e-ne-2* 


ant members can be chosen in such 


pr T 
vesbility 1, that in а sam 
У that each of them is concor 


2 
wo pairs have one member in common, while the third is 


—8 ae i 
ТЕ 2 ) multiplied by the probability, (kp), i.e. the 


(v) The probability that * 


Com: -? 
Pletely distinct, 3 (2) 2(n—? 
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| Бо та гё. 

bability of getting a set of three members with the property mentioned in (ii) above; 
oba pur us 
ча. by the probability of a distinct concordant pair: 


Э-э» 


F ith 
(vi) The probability of drawing three pairs, all with one member in common, and wi 
br E А = а , TT robability 
the other members of each pair distinct, з(2) (n — 2) (n — 3), multiplied by the probabi 


і is rdant 
t' that among four members drawn from the sample without replacement, one is concorc 
with the other three: 


«() ^ (n—2)(n—3)t'. 
er 

(vii) The probability of drawing three pairs, any two of which have only one м 

in common, i.e. if there are three members, А, B, С, the three pairs are AB, AC, Во 


ош the 
22) (п— 2), multiplied by the probability that among three members drawn from th 


sample without replacement, each is concordant with the other two, и": 


x) + (n — 2) u'. 


7? (m —€£ = „рей 
(уш) Тһе probability of drawing three distinct pairs, m (" У) р " " F multiplie 


2 
by the probability that they are all concordant, ( 3)”: 


e Cuv. 


In a similar manner, (р')8, the probability of 


a 
drawing four concordant pairs from 
sample, replacing each pair, is the sum of the follo 


wing twenty-three probabilities. 


А = у һе 
pair four times, (2) И multiplied bY Џ 


7-73, 
(;) ». 


m. xn ы in 
(ii The probability of drawing three identical pairs, and the fourth with 000 1 
зү за ers 
4 *(m—2), multiplied by the probability, Ду, that among three memb 
drawn from the sample, one is concordant with the other tw 


(i) The probability of drawing the same 
probability that it is concordant, р’: 


common, 4( 


о: 


п\-3 [n — 9 "ee 
4( | ә |: multiplied by the probability tha. 
are concordant, ( py: 


(97 (n—2) (n — 3) (poy. 
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(iv) The probability of drawing two sets of identical pairs, with one in common 


n\ -3 
x( 2(n — 2), multiplied by the probability, k’: 


6(;) 7 а 2). 


(у) The probability of drawing two identical pairs, and the third and fourth with 


the в; е A м Н 
е same member in common with the first two, and other members distinct, 


NS 
44) 2(n — 2) (n — 3), multiplied by the probability. t', that among four members drawn 


wi Р 
thout replacement from a sample, one 1 


12(3) 7 (-2)(-3)t. 


2 


s concordant with the other three: 
> 


(vi) The probability of drawing two identical pairs, the third with one member in 
common with the first two, and the other member in common with the fourth pair, 
в(”\ 73 | 

B 4(n — 2) (n — 3), multiplied by the p 
an be chosen in such a way that each of them is 


robability, l’, that among four members drawn 


fr 
Е От а sample, two concordant members с 
Oncor 2 з 2 
ncordant with one of the remaining two: 


+ EN 4 i төлі pairs, with one member in со it 
(vii) The probability of drawing two identical pa mmon with 


В n\ “3 
. Е dna DE OAT; 
er member in common № ith the fourth pair, 6( “| 2(n — 2) (n — 3), 


t › 

he та pair, and the oth 
ТЕТЕ тя 
Nultiplied by the probability, Г (as m (vi)): 


ЕН 7 (n—2)(n—3)U 


TD T A :dentical pairs, with one member in common 
; (viii) "The probability of drawing two Бей п он (2-3 Е 

“th thira pair, and a fourth pair distinct, 12(3) 2(n-2)| 2 J.mu tiplied by the 

Probability, (Iep)’, as in (i) and (ii): 


12 () ^ т 2) (n— 3) (®—4) (ір). 
"M 
identical pairs, with the same member in common 


-3 
n 
rs are common, 6(;) 2(n — 2), multi- 


awn from the sample without 


(ix) The probability of drawing two 
Wi қ 
Ith the third and fourth pairs. whose other membe: 
nu by the probability, "^ that among three members dr 
Placement, each is concordant with the other two: 
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ea) t) 
(x) The probability of drawing each of two distinct pairs twice, x J ( о |’ 


2 


: 1- 

multiplied by the probability, (p?)', that two pairs drawn without replacement are co! 

cordant: 3 -3 ics 
s) «-2e-»2u». 


; à in 
(xi) The probability of drawing two identical pairs, and two other pairs with one 
common, (47 > ” 2(%— 4), multiplied by the probability, (Ер), as in (viii): 
n\-8 А 
«(2 (n— 2) (n — 3) (n— 4) (Ер). 
(xii) The probability of drawing two identical pairs, and two distinct pairs, 
"1-3 (n —2 [n —4 
973759 
t: 
multiplied by the probability, ( 3)’, of getting three concordant pairs without replacemen 
3 (пу => М үде. 
519 (n— 2) (n —3) (n— 4) (n— 5) (phy. 


ers 
(xiii) The probability of drawing three pairs with one member in common and oth 
distinct, and the fourth pair having the uncommon members of the others, 


407 4(n— 2) (n—3).3, 


multiplied by the probability, 
replacement, one is concorda; 
each other: 


“cp aut 
a’, that among four members drawn from the sample Шығ” 
nt with the other three, two of which are concordant 


мі) = 2) (n—3)a’. 


te H . ber 
The probability of drawing two distinct pairs, the third pair having one “> 
each from the first two pairs; and the fourth pair having t 


multiplied by the probability, b^, that amon г 
replacement, each is concordant with two o 


(xiv) 


Q3 Th ) D 
he other members, 3 А 42 6 

а а hou 
g four members drawn from a sample wit 
ther members: 


6(3) (n— 2) из). 


ss Ж; 10% 
(v) Ths probability of drawing three pairs with one in common, the fourth pair a 
one member in common with the uncommon members of the first ‘hina pairs, and әзі; b 
‚ё 
» multiplied by the probability, > 
among five members drawn without replacement, А " other?: E: 
of which is concordant with the fifth: ы ‚ °пе1в concordant with three 


== 
member distinct, «(5 2(n — 2) (n — 3) ®— 4) 


е 


n\ -3 
24(5) (n— 2) (n— 3) (n 5 w, 
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and other member in common with a third pair, while the fourth pair has an uncommon 
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РЦ 


-3 
member of the second or third, and a distinct member, (3) 4(%— 2) (n — 3) (2n — 4), 


multiplied by the probability, 2’, that among five members drawn without replacement 
from a sample, one is concordant with a second, which is concordant with a third, which is 
concordant with a fourth, which is concordant with the fifth: 


ч) 7 т 9) (n—3) (n—4) 2", 


(xvii) The probability of drawing one pair with one member in common with a second 
pair and other member in common with third pair, while a fourth pair is distinct, 


(47 4(n—2)(n—3) ку) 


2 
multiplied by the probability, (їр)', as in (vi) and (i): 


"(47 (n—2) (n—8) (n—4) (n — 5) (№). 
pairs, any two of which have a member in 


(xviii) The probability of drawing three a қ 
2(n—2) (" : | multiplied by the prob- 


т 
common, while a fourth pair is distinct, (3) 
Ability, (ир), as in (ix) and (i): 
(47 (а-2)(в-3)(а-4)(ар). 
wing three distinct pairs, one of which has a member in 
о 


Q3 [n—2 н — 6), multiplied by the probability, 
Common with the fourth pair, 12(; ( 2 í P7 ‘i · р 


(Ер?) as in (ii) and (iii): 


= А мен 
x) * (n-2)(n—3)(— 4) (n 5) (n — 6) (kp?) 


(xix) The probability of dra 


2 


(xx) Th bability of drawing four pairs with one member in common and other 
5 ө proba 


=$ > 
members distinct, 2 (3) (n —2) (n— 30 £ . 
2 ut replacement, oneis concordant with other four: 


five Members drawn from a samp 
4) 6—2) 0—3)%— 27: 
2 


(ххі ility of drawing three pairs with one member in common, and a fourth 

1 15 , à 

байыд: i "Ue жола? » 3 eat ‚ multiplied by the probability, (0р), as in (v) 
distinct, 4(4) 2(n-2)(n-9)| 9 

and (i): 2 


4), multiplied by the probability, y’, that among 


le witho 


(n— 2) 


27-2 


77$ a) (n3) (0—4) (0—8) (p) 
(€) 
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of dr i and two 
ii) The probability awing two pairs with one member in common, anc 
(xxii e 


esi n—3|, " iplied by the 
ther pairs with a member in common, 3(3) 2(n— »( 2 ) 2(n—5), multip У 
о 2 


probability, (1?)', as in (ii): 
6(;) ag (n—2) (n — 3) (n — 4) (n— 5) (13). 


—6 
: n\-3 (n—2 v ( ) 
(xxiii) The probability of drawing four distinct pairs, ( A ( 2 ) ( 2 | 3 


multiplied by the probability 


of dra Н я е place- 
(phy, wing four concordant pairs without г I 
ment: 


n E" (n—2)(n—3) (n— 4) (n— 5) (n — 6) (n— 7) (pt. 


Collecting these terms together, we get the following results: 


из") = () “о-ө-» (65 + 2и) + (n — 2) (n — 3) (p? 4- 61 + 20) 


г 9) 
+ (n—2)(n—3)(n— 4) 3kp + 4(n— 2) (n — 3) (n — 4) (n — 5) рз), ( 
-2 
Hlp’) = (3) {(61+ 2¢—18kp + 10p*) n? + (6: + Зи — 6р2 — 30L— 108 + 724p — 34p?) n 0) 
x 1 
+ (p+ 9p? + 305? — 12k —4u + 361 +. 12t—72kp)}, ( 
, ту -= 
т) = (: {p+ (n—2) (14h + 12u) 
+(n—2)(n—3) 


(Ep? + 12: + 3614 24a + 6b) 
+(n—2)(n—3) (n — 4) (18kp + 24w + 242 
+ (n—2) (n— 3) (n — 4) (n — 5) (2p3 4- 12lp + 4tp + 6k) 
+ (m—2) (0—8) (@— 4) (n — 5) (а 6) Shp? 11) 
000—2) (0—8) (n—4) (m5) (п—6) (%— туре), 
нар") = Е ЕТЕ nt 
+ (6kp + 24w + 24% 4. 2y — 653 — 96lp — 
+ (Bp? + 121+ 361 + 24а + 6b 
— 24wp — 18y + 758 
+ (14% + 12u — 12 


+4ир+ 2y) 


321p — 8412 + 270kp? — 108p) n? 
—126kp — 216, — 2162: 

+7201р + 24015 + 49612 
— 60¢— 180] — 120a — 306 
+444Ёр + 624w + 6243; + 96up + 52y — 213p? 


—1776lp — 5991р — 9241,2 + 2988hp2— 18874) у, 
+(р—28К—24% 4 21p2 


— 1446 р? + тора) n^ 


+728 + 21614 1442 +366 — 4328р — 576w 
— 576% — Збир — 48y + 180p3 + 1440]p + 480tp + 72042 12) 
— 2160р + 630p4)}, ( 
Considering only the dominant terms, we find ; 
r 13 
OP) ~ 4(E— эу, aA 
“з(') ~ Const. 2-2, P а 
Pa UP) ~ 48b рада, ( 
so that £,>0 and Рь->3 as n>, 
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This agree refore wit e а 1 iv i 
ересен vnu nri пење ene ots,” 

9 ће поп- саве. 

These formulae for the third and fourth moments involve ten parameters, namely. 
D, k,l, t, u, а, b, w, ж and у. In the general case when nothing is known about the pareit 
Population, we have no information about these parameters, except that provided by the 
sample itself. Then we can use sample estimates of these quantities, suitably corrected for 
bias where necessary. These computations may, however, be much too tedious for common 
applications. It is therefore interesting to study the values of these parameters in special 


Cases. 
4. SPECIAL CASE OF INDEPENDENCE 


We first evaluate the ten parameters in the null case of independence, partly because it 
Provides a check on our algebra, and partly because it is useful to have these values in 
extending to the non-null case. 

Let X; (i = 1, 2, ...) be a series of random vari- 


First we collect some preliminary results. 
ables, independently and identically distributed according to some common distribution 


function F(x). Define и = X;— X;. Then we need the values of quantities like 


E (sgn иу» SZN 53), 
Where sgnug;- 1 if 4520, 
0 if ш;-0, 
2-1 if ys 0. 
We с onsider also tq = Ro Y, where Ү,( = 1,2, ..) is a series of independently and identi- 
cally distributed random variables. | 
the probability that both X, and Ху are less than 


Б Obviously, Е(вап и) = 0. Given Ху, y ; 
"t is Бал); and the probability that both X; and X; are greater than X, is (1 — Баур. 


"herefore қ " 
Pr. (Qr, – X) (Ki X)» 9) = | (area + п- rear = + 

Pr. (X, Xa) 057 39 <0 = Б 

80 that E(sgn Sgn ths) = è 

meters in terms of the w’s and the v's. Under the null hypo- 


We can express the ten рата 
pres P nce, we can evaluate all these parameters. As an 


thesis, every и is independent of 
Xample, we evaluate the quantity 
в = E(3(sgn 012880 012 
= 18 (sgn t,» 880 илз) E (sgn 942860 v3) 
+ i(E(sgn тла) E(sgn Ф) + E(sgn шз) E(sgn %3)} 
4 2. 


any v. He 
k by this method: 


+1) беп иза SZD vs +1)) 


+= те 
Іп this жау, we obtain for the null case: | 
pad и= $, TE eom 
ус а=7> у= т 
k= > 72 (16) 
1-4 b5% 
t = i w= 16%, 
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However, this method of evaluating the parameters does not wed — d 
lization to the non-null case. We therefore consider an alternati e m ребе. 
by Hoeffding (1947) to evaluate k, and based on the consideration that in the € га "s 
т кодеин of one set of ranks relative to the other are equally pulis. nervi ren e 
we write P(1423) to indicate the probability that the ranks of the Y's occur in 
when the X's are arranged in order of magnitude. 
(i) р=% 
(4) k= P(123) +4P{(132) + (218)} = 5, 
(ii) Z= P(1234) + $P((1243) + (1324) + (2134)} 
+ $P(2143) + LP((1342) + (1423) + (2314) + (2413) +(3124)} 


-11 
md 


(iv) t= P(1234) + $P{(1243) + (1324) + (2134)} 


+ &P(1342) + (1423) + (1432) + (2314) + (3124) + (3214)} 
=>, 


(у) w= P(123) = 4, 
(vi) a = Р(1234) + }P{(1243) + (1394) + (2134) 
+ ds P((1342) + (1423) + (2314) +(3124)} 


dt. 
72» 


Il 


(vii) b= P(1234) + 4P((1243) + (1324) + (2134) + (2143)} 
7 


72» 


(viii) w= P(12345) + $P{(12354) + (12435) + (13245) + (21345)) 
+196(12453) + (12534) + (13254) + (13425) + (14235) 
+ (21354) + (21435) + (23145) 4 (81245) 
+9 P{(12543) + (14325) + (30145) 
+ 89 Р((13452) + (15284) + (29415) + (41285) 
+ Pi(19524) (14258) + (21453) + (21594) + (23164) + (24186) 
+ (31254) + (31425)) 
"xg Р((13542) + (14359) + (15243 
+ (82415) + (41325) + (42135) 
* ds P(21543) + (32154) 
RAD E (28514) + (Вав) + (25190) «(у 
+ (31524 + (34125) + (4125з)} 
+ goP{(14582) + (15349) 
+ (34215) + (41352) + 
+352 ((24513) + ( 
+ (43152) 


) + (15324) + (24315) 


+ (15423) + (25143) + (25314) + (31542) + (32514) 


(42153) 4. (42315) + (43125)} 
25413) + (34152) 


+ (35124) + (35214) + (41523) + (41532) 


== bbs 
^ 1800" 
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(іх Жаз эле 
) n І penis &P(12384) + (12485) + (13245) + (21345)} 

tie 3) ж Мед тен > позиву 245 
ds (12543) + (13452) + (14825) + (15234) + (2154 li лат 
+ (32154) + (41235)} ан банан ы 

4 2P((13254) + (21354) + (21435)} 

E E + (14253) + (24135) + (31425)} 
ү Р((14523) + (23514) + (25134 52) + (34125 

deg оно Moro тон 

+ 2,Р(24153) + (31524)} | 

+ glpP{(13542) + (14352) + 
+ (31542) + (32415) + (82514) + (34152) 
+ (42135) + (42153)} 


+ d P((25314) + (35142) + (4135 


(15243) + (15824) + (24315) + (24513) + (25148) 
+ (35124) + (41325) + (41523) 


2) + (42513)} 


12435) + (13245) + (21345) 


(х) y P(12345) - P (02354) + ( 
12543) + (13425) + (14235) + (14325) + (23145) 


+ЁР{(12453) + (12584) + ( 
+ (81245) + (82145)) 
+ 1 (13254) + (13482) + ( 13524) + (13542) + (14253) + (14352) 
* (14523) + (14582) + (15284) + (15243) + (19822) | 
+ (15342) + (15423) + (15432) + (21354) + (21435) + (23415) 
+ (24135) + (24315) + (31425) + (32415) + (34125) + (3421 5) 
+ (41235) + (41325) + (42135) + (42315) + (43125) + (43215)} 
wl = 75. 
hen we substitut : i 
n e these values for the ten paramet 9 
esult as (7) and (8). p ers in (10) and (12), we get the same 
OF NORMAL CORRELATION 
arameters in the case when the 
parent i 
from (5) population 


5, SPECIAL CASE 


f the ten p 


We n . 
ow consider the values 0 
ase we already have, 


ің bi * 
variate normal. For this © 


_1 ПЗЕ, | 
pate p- 
(6) we have 


Fur 
ther, by comparing (2) and 
4 
- 20 tes (sin) 


ward 10 = sin -4 in*ip* 
e same methods used to deduce the 


valuated exactly by th 


One 
а, BE parameter can be e 
3-1,6, 
ӘЛІ 4$. ед bh лук 
mre cn +?) Авц). (18) 
their evaluation involves the same 


mined exactly; 
in evaluating the variance of 


ot be deter 

nted out by Kendall (1949) 

efficient 7; exactly under normal correlation. Howeve 
у easily, because from the results a 


an be obtained fai 
f five need be considered. I have shown in 


Th 
Ype € other parameters cann 
E difficulty as that poi 
man's rank correlation 00 


estj: 

m 

6 ciel of these parameters с 
evious section we 866 that only samples О: 


420) Moments of the rank correlation coefficient т 


5 ; amatic od of sampling may be used in such 
ашнен rS e amete cr пастар ас иаа distribution of пете 
jeu "X = Cotte fain a bivariate normal population with correlation p = de of 
222 of three parameters (for the case p = 1/42) and the peg е 
the other seven parameters, obtained from the above sample results, are as follows: 

р = 07500, а = 0-3470, 
k = 0-5770, b = 0-3485, (9) 
“ = 0-4431, w = 03451,» 
t = 04543, = = 0-3363, 
l= 0-4414, у = 0-3642, 
Substituting these values in (10) and (1 


2), we get 


n\ -2 | (20) 
ne (5 {—0-013625п 40-0007» — 0-0427}, 

re 4 21) 
= (5 (000126151 — 0-043162 + 05902722 — 1-9285n + 1-70314]. ( 


These formulae are not entirely s 


"Instration of 
atisfactory, and are given here onl y as an illustratio! 
the method. Sampling 


i : ; responding 
experiments to improve these results and to obtain correspo! 
formulae for a range of values of p have been put in hand. 


nd 
However, the argument of this paper shows that the third and fourth moments of ( depè 


a » prob- 
only on ten parameters, and that these parameters may be evaluated by finding the P7 
ability distribution of rankings of five from correlated populations. 


+ Kendall 
In conclusion, the author would like to express his indebtedness to Prof. М. С. Kend 


ноћ 0 
for suggesting the problem and for many valuable suggestions during the preparation 
this paper. 


[Note added in proof.] With regard to the argument of section 2 on the upper bound 9 
variance of t, it has been objected i d to Dr Hartley for pointing this out to ur be 
the transfer of units between the quantities designated. P, and P, is not always possible; i.e. Mu 
2 pe о find a ranking in which the Ps have the & 
values. This objection may be met as follows: 

The problem is esse 


f the 
that 
ib 


and certain other constraints on the Р% by virtue 


there is a ranking corresponding to the alte: 
satisfies all the constraints ¢ 


" . . 2 sol 

у maximization procedure, In section 2 only, show? 
these constraints are taken into account during the maximization procedure. However, it 18 сша) 
that the maximizing set of P/s does satisfy all the constraints because it corresponds to ап Р 
ranking, i.e. the canonical ranking, 


pe 
т "ence А -- . | © 
Now, a quantity which is maximi Ject to only a part of the set, of all constraints mU ined 
greater or equal to the maximum sub; КШ ae cris nm Л 
satisfies all constraints, then it mus а My , 5 


t be Ри: 1 subject to all constraints. 
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99-9 AND 01% POINTS OF THE x? DISTRIBUTION 
By T. LEWIS, Statistical Advisory Unit, Ministry of Supply 


v. In answer to a special inquiry the lower 0-1 % points of the x? distribution have b 

aie 2. correct (о 8 decimal places for degrees of freedom т = 1(1) 30(10) 100 T20 nie 

o the upper 0-1 9/, points for т = 40(10) 100, 120. The resulting values вери digi Р 
2 1 


ma ee ч s 
ay be of value to other statisticians are given in Tables 1 and 2 below. 


Table 1. Lower 0-1 96 points of X? 


— 
Degrees 
овтеөв of ; " Degrees of А * Degrees of 
freedom Value of Y ето Value of ү i npud. Value of y? 
1 а | o 
0-0000 0157 16 3-9416 2784 40 a 
5 0-0020 0100 17 1-4160 9272 17-9164 2654 
: 0-0242 9759 18 4.9048 4881 50 wo 
4 0-0908 0404 19 5-4068 1602 24-6739 0527 
9 0-2102 1260 20 5:0210 4075 60 31-7383 4159 
6 0.3810 6676 21 6-4466 7656 70 39:0: 
ке = -03 LÀ 
: 0-5984 9375 22 6-9829 6844 63 7739 
0:8571 0483 23 7-5292 3977 80 46-51 
9 1-1519 4955 24 8-0848 8158 ЕР 
w 1.4787 4346 25 8:6493 4363 90 54-1559 4356 
11 — 2 9.2221 2682 B 
1-8338 5266 26 9:22 100 61-9179 392 
Ы 2.2142 0932 27 9.8027 7692 191982) 
; 2.6172 1815 28 10-3908 7912 120 77-75 
14 ч 29 10.9860 5349 ааа 
19 3.4826 8447 30 11:5879 5105 
Table 2. Upper 01 % points of X? 
) 
сев of 
Degrees of лае of д? Degrees 0 Value of x? 
freedom ve x freedom 
73-4019 5752 80 124-8392 2402 
| ей 15.6608 1519 90 137-2083 5413 
60 99-6072 3307 100 149-4492 5278 
| A 112-3169 3185 120 173-6174 3646 
CULATION 


METHOD OF CAL 


of freedom, т = тапа? = 
degrees of freedom, and 7 


Jy. If, then, 29 ів the 100. % point 


2. We wri 

- We write n for degrees ет à 
oft ! "itt «DP» у 
the X? distribution forn | is written for P(r +1) whether 7 is 


inte 
gral or not, E" Г — Р, 
T(m) J v 
flv) = 9 


ог; 
> M other words, 


422, 99-9 and 0-1 % points of the x? distribution 
2 ym-i 
where Ке) = + t жаты) Ре” [n even], a) 
={ Bel renr + "т Ре [вода]. (2) 
ла = | ја“ Је stet wo Pe” oda 


The exponential or quasi-exponential sum in curly brackets, i.e. /(0) + Ре", we write as 
8 „лапа the term v"3/(m — 1)! as 7,1. 

3. Each required value of v(m, P) was obtained by computing f (v) directly for an approxi- 
mate value of v, v = v, say, and improving v, iteratively to v, + до by means of the formula 


бо = A aà? + ВАЗ, (3) 
where A= Se) 1 | 
Та = 
1 Е беч Tas) (4) 
a = = | 4 
2 Шел 22.6 
В = 202 _1 (та = Тә + тз) 
6 Та V=U9 Ў 


This formula for до follows from the Taylor expansion of v as a function of the small 
quantity ш-е-” f(v), thus: 
v = vy when w = ef (vp), v = vo + д0 when w = 0, hence 


= dv ES d?» d? 
ь--езде( ) tHe) (5) inta (i5), 
But w = e S, ,— P, hence 
dw 


dv gius (8,2 x 85,4) mice Т cds 
dw 

ал” e (Ina — ие), 

Bw 

ust е” (Та - 2T a аи m а); 


whence the values of the inverse derivatives 4%)4ш” and the relations (3) and (4) readily 
follow. 
OUTLINE OF PROCEDURE 


4. For the first approximation v,, use was made of the well-known formula 


3 Яс Н 1 
A (v[m) — 1 mt / ar 


dueto Wilson & Hilferty (1931), in which the 100P Фо value of visrelated t thecorrespon 
unit normal deviate v. The terms Т, = vir! were then gee othe жу 
from the greatest term Ту; the extra term puted in succession, 


2 © 

т = | v Í 

с 67 eie 
7T о) dt 


required for odd т was obtained separate] 


ding 


y from Shep 5 У ple of 
| pard’s (1939) 12- 1 ta 
ett? | aM Ғана Ha антуна. ( ) 12-decima. 

т 
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The Р 

м ^ ша Т, were then added together to give the sum S,,_, and also the last few parti 

СТЕ Using the National Bureau of Standards (1951) 15- and 18 dech = 
8-decima 


tables A ae : 
es of e+7, e"o was obtained to the requisite number of places; the difference 


Ле) T NP Реч 
was the А 
s then taken, and the correction ðv computed from the relations (3) and (4). 


For the ђ : . 
T ді the next iteration, with v = vy + Ov, no computation of terms Т. was necessary to 
ain the new sum &,, (v, + 52): this being caleulated with ease from the first few terms of 


the Taylor series 
8,-1(% +00) Р Sinz 


th Я 
ле S, on the right-hand side being the values for v = Vo 
e Taylor series 


m 
m-1(vo + др) was obtained from th 
Tot 400 Tm- t ee 


qi T до mos 
s required, the sum 8, (vy + àv + 82") being 
бо + до" in the Taylor series (5). 


1 + до Sin-2 i jov Sm-3 + 95 Sn—a Tes (5) 
already recorded. In the same way 


ation at most wa: 


After this, one further iter 
ceumulated correction 


ev 

aluated as above by using the à 
ACCURACY OF METHOD 

ue for which the ‘discrepancy’ 

differs from 0 by a small amount 

dv of paras. Запа 4), is given by 


v is determined as the val 
Tf in fact Sm-1 -Ре 
osite in sign to the 


z 
fe 5. In the above process, 
А Y = S, ,— Pe" comes out to zero. 
› the error ó,vin v (80 written because opP 
ын 8($,.1 — Ре) == (S, — Pe") дуо 

~ (Sn-2— Sma) Que — T 0,7, 

дуо~ — e|Tm-x 

that the pivotal term T, is calculated to N significant figures, 
t denoted by % 80 that 10-0797, >> 10-У7,. Clearly Т, and 
| he few terms T. of like order (ie. with the same number of figures before the decimal point) 
ivo errors of order u, the terms of next lower order (with one Jess figure before the decimal 
Point) have errors of order 10-1ш, and so оп; the error in Pe is negligible. Hence e is at most 


Of order 10-007. 
When Р = 0-001, T, 
54 decimal places. 

hen Р = 0-999, 
he 
nce | 8, 2 | is of order 


T Ў 
js v is certainly correct to № 
Ccordingly the terms Т were © 


107 


Т С 
1 3 estimate е, let us suppose 
init in the last place being 


=T, hence DU | is of order 10785, and v is certainly correct to 


= РС ~ 105, 


ТТ d 
10-9, [Ti 


—4 decimal places. 
alculated to 12 significant figures for P = 0-999 and to 


re. TOON: 


à significant figures for P = 0-001. 
CHECKING OF FINAL RESULTS BY DIFFERENCING 
is 8. Tt can readily be shown that (Р), the 100P % point of x? for n degrees of freedom, 
ОЁ the form meal T (8) 


Whe 
те a,b,c, ... are functions of P. 


494. 99-9 and 0-1 %, points of the x? distribution 


For this reason, direct differencing of the results уң(0-999) against n for n = 1(1) 30 and 
again for n = 30(10) 100, and of y2(0-001) for n = 40(10) 100, was not practicable. An 
attempt to base the differencing check on the residuals 


{А2(Р) — corresponding Wilson-Hilferty approximation] 
also failed. In the end it was found possible to check all the results for n > 14 by a differencing 
method based оп the accurate evaluation of series (6) as far as the term in тї, and to check 


the results for » < 14 by means of another series. Details of these series are given in paras. 7 
and 8 respectively, and the steps in the checking procedure are outlined in para. 9. 


7. Series for x? in terms of the corresponding normal deviate. If x is the 100P 9% normal 


deviate, so that i У 1 
jon | е = Р = ==— etii 
osi]. M % 


then the coefficients a,b,c, ... in (6) can be expressed as polynomials in т. Developing the 


series for convenience in terms of m= in instead of n, we may write 


№ = 2т-+ т + (bya? +b) + (cov? + суа) m + (doa +422 +4) m... (7) 


where а, bo, by, со, ... are constants. The values of these constants can be determined БУ 


equating moments about zero for the left- and right-hand sides of (7), bearing in mind that 


the rth moment of y? is ?^m(m 4-1) ... (т--ғ-1) and that of т, 0 when r is odd and 


(7—1) (r—3)...3.1 when r is even. For example, if we take (7) for convenience in the for™ 


№ — 2m = az mi + (byt? + b,) + (m3) 
and equate Ist, 2nd and 3rd moments each side, we get 


2m — 2m = by +b, + О(т-1), 
—4m.2m+4m2 = ат + O(1), 


8m(m + 1) (m + 2) — 6m. 4m(m.4- 1) + 12m? 2m — 8m3 = За (3b -- b,) m+ O(1) 
0 1 ? 


3. Clearly the sign of ay is +, since 


02 = я 
XP = 2m + arm! + (2—1) + 


4m(m + 1) 


whence b, +b, = 0, ag = 4, 3b) +b, = 0; 
thus series (7) begins X> 2m when 27 `? 


In this manner the expansion may be take: 
n to an i г 
сідік У required number of terms; to ord? 


№ = 2m 2am? + $(а#— 1) + (аа — Tx) т-# 


— 05(824 + Tx? — 16) m-.. 


1 
19449 (925 + 25623 — 4337) m- (8) 
In particular, when P=0001, x= евра, айдар 5623 — 4332) m? + .... 
and when P=0-999, + — , А 
hence, from (8), 3-09023, 23062; 


2 „(0-001) = 2 + 6:18046, 461247 4. 5.69969 0 
‚ 04708 


+0-43770, 32003m-1 — 9.89105 59174m-1 
+ 0-45024 Ж | t 
› 93635m- г 
= 2т+щті tutu jm 1 
Ж. ~1m- p 
and Хап(0:999) = 2m — шті + us y а ete 


М вау; 
m^. u oum 


иат... 


жуа Т. Lewis 
- Check series for уң a 425 
when P j or x2(P) when nis small and P nearly 1 і 
is near to unity, so that v/m is small, ои and (2) that 
om рт+1 


(Ре 
је аа" 


_ (s us v 
m! т+1 eas): 
hence E 
ут = п-т ) 
m+ У 

ог 

а јашу“ ез Me RN қы 

mal C з 


on ex 
LS pandi е : Р Е 
ng the exponential and binomial, multiplying together, and putting 2 
Qm Ang 20 = qa. 


this gives 
№ > 2 919 
2[т (1 -Pypmf1+ X + (г 
n2. 2(n+2) (n+4) 
2) n—2 


+ Риа n 
6(m + 2) (n+ 4) (п 6) n. 2 
qi — 20п%— 134n — 64 
@+2)* (n 
(n — 2) (n — 13612 — 916n — 704) 
TETTE) | ms (12) 


en series (8) and the computed values of 
over the higher range of values of 
the other hand, the algebraic тық 
have been laborious. The following 


+5 (a 
24(n + 2) (n4- 4) (n+ 6) (n4 8) 


2)9 


4n р | 
120+ 2) (n4) (n+ 6) (а + 8)(®+ 10) 

The comparison betwe 
asis for а differencing check 
ay 15 and 30. On 
(8) would 


9. | 
x de hecking procedure. 
т, |. an adequate b 
Dutation > 30, but not for n between 8 
ед. of further polynomial coefficients in 

5 was therefore adopted: 
avail + equations (9), (10) and (11) and the set 
able for n = 40(10) 100, the quan 
0-001) + (0-999)] = 
2(0-999)] —umi— 
and recomputed 
—4 in (9) were then 


of paired results y2(0-001), у(0-999) 
tities 

= Rl 2m — 00 — шат, 

һе 10200001) 72 


differencing: 
pu m3, т 


пи 4m — пут, 


where necessary. Estimates 


Wer 
were calculated, checked by 
obtained by fitting the seven 


ӘЙ Дй 
quantit; ú, of the coefficients ofm 
ities g, to the relationship 


on, 4 S al —4. 
In = 4 Qm Tuam 


2.4 QT 


simil 
arly, à, , d 4, U4 Were obtained from the ћу 
14(1) 30 were по 


т g 
he values of x3(0°999) for ^ = 


w checked by differencing the residuals 


Ppug—- 1 m4. 4 m ?— атъ. ti_gm-*). 
кей from equation (1 2) by evaluating 


y2— (2m — 47 
the two sides of the equation and 


(0-999) were сћес 


ues of Xn 
cy between 


2 the discrep” 
ptained. 


Fo 
oy OT = 11) 14 tbe và 
i ch given value of X 


ffer А 
encing the quantities 50 о 
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STANDARD APPROXIMATIONS TO X? WHEN 7 IS LARGE 


10. It is of interest to compare the two most familiar approximations for y? with the 
exact series obtained above. 


I. J(2x?) distributed approximately normally with mean V/(2n — 1) and variance 1 
(Fisher). 
This gives № n x2ni + 1(22—1) + O(n). 


П. #02/т) distributed approximately normally with mean 1 — 2/(9%) and variance 2/(97) 
(Wilson-Hilferty). 
This gi 25/2 
SEVES imeem 2nd 5002 уфа? (a9 — 62) n= + O(n-), 

ПІ. Exact series (equation (8)): 


2 
X = та 2n + 2 (a? +3 (23 — Ta) п + O(n-), 
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TABLES OF SYMMETRIC FUNCTIONS. PART IV 


By Е. N. DAVID AND M. G. KENDALL 


& Kendall, 1949, 1951) we have given for function: 
, ions up 


to and i ч 

the iem үт weight 12, tables of the monomial symmetric functi 
art functions (S) and vice versa, tables of the unitary e a 00 е terms of 

ables of the homogeneous product sums ( H)in mt n in terms of 

a definitive set of tables of symmetric unà ТШШЩ 

cation we shall complete the set = tit a 

valent to the US-SU “tables ih 


l. In re 
n recent publications (David 


1 

p л ce versa, and t 

Dreset the af ed task of compiling 

the USSU H-H M tables. Ina further publi 
tables, the final tables HS-SH being equi 


Suita 
"E changes of sign. 
A Man 7 
simplest i y methods can be used for compiling and checking the pre 
S 
st is to start from the relationship a One ofthe 
n! 
sf sg s sp. 


al, = Dimon роті 
no “үң Эл. krem! To! mo 


d into monomial symmetric functions using Part I of 
i our 


ely, we have 


3,22 (= — pnb- жа i 
PALA er! 
al symmetric function and express it in terms of the S-functio 
i - ns 
ms of h gives the required result. The whole of 
tor to build up each table 


ution for s in ter 

by using MacMahon's D opera 

ples MH were checked in the same way, first by using the 
relationship which result. For the MA 


ations of 
e sum of the coefficients is zero. 
the uses of the U-functions in 


analogous way and we quote 
MacMahon can be stated as 


Th 
е S-functions can be turne 


table 
S a 

and the result follows. Convers 
Has Раз... МЕ. 


Жом. ake the monomi 
B Substit 
а HM was checked 
Operator ely. The converse ta 

and then by solving the simple equ 


able: 
8 we have the additional check that th 
o Parts TI and ІП we mentioned 


H-fanctions can be used in an 


3. 
In the introduction t 
A distribution theorem due to 


Prob] 
; WEE of distribution. The 
о len only one example. 
T 
һе number of ways 9 
pty, 


Spe сі А 
ification (17), по box being еш 
дәт Pr Pa т" Pr) 


of specification (рара --- Pr) into boxes of 


f distributing objects 
о the coefficient of 


is equal t 


© development of the function 
(ht + Te Qu 
4 method of attack is the 


Кое we come to use this th і found that the easies 

i e n of the powers and products 0 D terms of the M-functions. This 

e ‚. 

4 of the simplest theorems 
* In order to save 820% 


Hi 

Wsch’ 

псы s law of symmet! w. 
On (7, v, ... 7j) in the expansion of iy, Ms 


ich states t 
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tric function (lle --- lm) in the expansion of h,,h,, ... hp, This enables each table to м 
tme as a triangle and the only difficulty in putting the two parts together has been at е 
ны са Inorder to express ће M -functionsin terms of the H-functions we read horizonta: у 
шаб the diagonal in bold type is reached and then continue vertically downwards to the 
edge of the table. Thus for w = 5 we have 


(312) = 343 — 131, I + 10757, + 127514 — Shaħo — 9%, А, + Shs. 


In order to express H-functions in terms of the J/-functions we read vertically until € 
diagonal in bold type is reached and then continue horizontally to the right to the edge о 
the table. Thus, again for w = 5, 


hgh} = 20(15) + 13(213) + (221) + 7(312) + 4(32) + 3(41) + (5). 


The two diagonal figures are separated by a bold horizontal rule. 
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Table 4.2 


Table 4.3 


Table 4.5 


WHS | ње hh? 


hèh, hyh? Аһ hih, Ms 


(213) 3 8 ц 1 
74 17 

(231) u g 1 
3-4 14 


10 5 

7 4 

Е a 

ey od. mod 

-2 3 
II 

2 

5 9 


PS & кыш шы 7% 
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Table 4.6 
| > = dto hs ы hè hà ііі қа h? қы fln М) 
a 720 збо 180 до 120 60 30 20 15 6 1 
1 
| (21) 192 102 54 72 38 21 14 її 5 s 
5 26 
(Gh) = 33 42 24 14 10 8 à 1 
5 -33 4t 
| ез | -1 6 10 = jji е ~ | 3 : ! 
(31°) ^ E ч aS s 4 Ж ! 
„ 721 2 -4 
(321) а 8 6 5 3 1 
-6 34 -48 в -J 6r 
(419) = 4 4 = 1 
-3 16 —20 4755 21 15 
(з) = E == 3 2 1 
| 1 -6 9 -1 6 15 3 6 
E 2 1 
(42) zi қ -6 w == 49 3 14 
2 12 9 "n 
(51) а. => 14 „а ая =й == 3 6 тї 
(6) | 6 6 6 -= 
-1 6 -9 2 = 12 -3 = -6 6 
Table 4.7 
ы 220 — а P " 
wey | AS hh? heh? hgh дай „иһ? hsh? hêh hh шаһ “halts hsh? қы hih ds 
(ау) | 5949 2520 1269 өс 840 420 шо ме 210 jg 09 ~ $ X 
(215) ® qua» 1600) 1509 48e 259 23° фо mns 70 55 Br um = 3 
(әз) -6 37 38 207 27° ша. = + "AE E = - 
1 —— 
- 10 op 17 ao w $7 51 37 51 39 13 15 9 4 1 
2*1) 
) 4 27 754 30 ов 114 62 46 73 39 15 21 п 5 1 
319) d 
$ 3 a == 28 67 39 30 43 25 11 14 8 4 1 
(321: я 
| 12 78 -143 а -70 3995 җ 19 jg 96 % ° 6 з Ж 
(BO х си a j; ee ^ш за dio C в 5 3 2 
" 2 
en) «=й g 7% го -60 1 б м ә am 7 ж, * 
= 22 
(41°) ж) $ =з 55 14 14 ўа 6 8 Р Жой 
-з 4 
"M SS b ш; 44 xw 
—26 19 
7 3 т 
18 
—12 


28 
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Table 4.8 
w=8 (i) h’ hh hth, heh? ћу hh? hhh? Mmhèh hèh? hh; 
| 
а» 40320 20160 10080 5040 2520 6720 3360 1680 1120 560 
— А 
(219) E пы 5400 2790 1440 3720 1920 990 680 350 
2892 2 
(2*1) 46 =e I 1548 828 2040 1092 584 412 220 
(212) FS LP cd E 480 1110 618 345 248 139 
25) 282 600 4 
( қ Ж = = Е 348 204 148 88 
(315) 1520 820 
6 —43 94 = 63 6 39 ыы = is 
(321°) —46 
= 148 -538 äs ЕР? К дз 260 194 108 
(321) 15 
E 4 116 69 
12 - 
(3:12) = м а И е " 
6 E = 92 54 
4 193 yo 6 44 — 165 102 66 
(372) 
—3 24 — 61 - —35 
(41%) -5 30 —79 5% 2 122 94 -8. =33 35 
(да 12 —90 209 = = ЕН а) p 735 19 
(42 -3 2 —62 6 ES 2 —298 192 99 -48 
(431) E 46 — —108 56 з 2% бө  -64  -23 14 
(49 І -8 20 — —16 5 = 176 —107 —74 31 
-32 20 = 
(513) 4 —29 64 - a Қ 
(521) -6 40  —110 { ат FUE 58 2 -16 
(53 2 6 о -n = ^a 
(бт? -3 22 m E E 16 – 67 En ES Т 
1 ) 49 3 Е: 54 27-23 
2 2 —16 41 Е 71 = = 
6 43 22 Ir 
71) 2 -15 34 —23 2 14 -56 48 15 =14 
(8) -i 8 = -20 10 c 38 mm 31 19 - 
32 —24 =12 8 
w=8 (ii) hihi hihih? Аһ? ЛЗ hë 
ge А m а hy hd? қық hsh, heh? AA PEN т 
» s 
а“ 1680 840 = —€— 
420 280 
QS ово Fm oS во s a s | 
ara 90 3 173 122 36 ТТ ы 43 І 
an 354 19 пл 80 25 52 80 30 32 =] % 1 
k 204 120 72 52 19 E 3 2% a 13 5 1 
31 500 265 10 4. 1 
(321°) 286 160 % % 2 136 71 26 
= 162 96 57 43 16 55 47 19 = г E 1 
(371 ) 126 76 45 % 52 31 T 22 12 5 1 
(32. 70 45 29 >м ae 8 26 E Hu % 4 : 
D 209. 115 6; Е 1 4 
(9 - 3 47 16 73 3j E 9 6 3 Е 
зуб в a "s 5 21 її 5 I 
—76 211 is “ 25 11 14 8 т 
ж 26 4 
(427) ES E — E 9 25 16 
die 36 8 9 6 3 1 
431 т 
40 —121 32 = 8 ә 13 7 
а») Е : ы 4 i 
-8 28 -12 =24 5 Ы 6 4 
(519) то 4 3 2 1 
-26 бо =g = 34 19 
4 28 8 
(521) B * 26 = 7 $ š 
— 195 28 42 
54 = — 6 
(53) к, Р: 76 5 3 8 у 
~ 39 - = 4 
(612) 4 13 XY — 4 3 2 т 
3 2 
21 — 50. 14 23 " 3 
(62) қ 4$. ar 29 == m 4 3 £ 
14 42 „ж эу 21 
(71) э * 14 -28 3 2 i 
5 38 -8 e 8 ті 720 
(8) i 4 15 К E 
8  -24 8 723 Я = = 
16 == Ej TS 8 15 + 
16 -8 -g 
ET == 5 


то ~ m ы 
TO GD Пелла? alts? ty Just тық hêh 
Teg === "-— 
б 
(5) 7560 3780 2520 1260 630 
2318) 4515 2310 1575 805 420 
зата) | 2670 1405 2o МЕ "ac 
21) 1506 852 боб 186 
318 912 516 372 12: 
Gir) 2115 1110 785 230 
(32315) 1229 669 480 154 
(325 710 403 208 102 
(3212) 408 243 181 67 
(3521 548 313 240 86 
(3° 313 188 145 56 
(415 135 87 (9 30 
(6 840 455 335 110 
4215) | 479 272 206 7A 
(дәй 520 
4231) 163 125 255 49 
“344 270 
(3:5 У хот бо 42 
7344 21 277 
(432) 7 77 ES 27 
184 — x 2 
(т) 4 154 131 II: ES 
то, - - 8 39 
(519) оер Ls 
= - 6 18 
(5211) 119 —71 7 3 
E -113 755 
(528) 317 200 200 
94 - _ 42 19 
(өзі) 4 -80 -54 
i — -1 4 Ж 
(54) по —143 
== ай: © 
(612) жо 9 73 
= Вт 7210 
ar) 99 61 56 3 
22 
(63) has = | 3 
= E -9 
(зу бо 45 45 36 
з Brew = 5 5 
(| “= = aga 79 
5180, таз -13 
(9) 35 43 
36 -27; -27 18 9 


DERE 


o24 1512 756 
3888 Du de 
1208 634 233 
738 402 2 
444 e 144 
4044 58 5а 
$40. 339 1 
370 29 , 
3% 129 78 
302 172 07 
$a. 3 3 
78 51 33 
501 266 141 
287 161 99 
ы 9: 5 
ту 77 и 
71 46: 59 
48 32 21 
209 215 63 
35 бв 49 
228 
-84 А 26 
21 -68 38 
42 —123 28 
ате am 59 
eke io 56 
ag cap BA 
mug 4: ^* 
24-37 16 
ET. 48 -23 
"E № 
эга NE 


hh? һы № 


-9 


DET 


Table 4.9 
w9()| m? һы hath! 
SO hh) hèh? hth dt dut 2 
i i s i a M E Sut dust hhh? hh? hèh? Ash] dy  hihè 
62880 18 72 
(19) 3 181440 90720 45360 22680 60480 2. 
= 480 30240 15120 7560 10080 5040 1680 15120 
т 93240 47880 24570 2600 
(217) E Es 4570 1 32760 16800 8610 4410 5850 зого 1050 88го 
(221° 25260 13320 7020 176. 2 
) du p aE 7640 9300 4900 2550 3420 1800 ббо 5070 
(2313 __7227 3924 9450 130 278 = 
у -20 170 -45 517 ИУ 75: зау МАЙЫ ШОШ” ја Еа 
(2*1) 2202 5040 2820 1584 $94 m 
& =a tae 564 586-222, Wo: gode ас. эбе 
(31°) 12840 6840 3630 1920 2600 137 
7 -57 152 — 146 36 52 A sm SIME AMA 
(3215) 3764 2064 1128 1508 822 24 2 
-30 250 -687 686 -177 - 228 1039 Qu on 
(32315) i173 0666 868 493 20 
30 -258 ти -789 222 231 —1110 1297 207 3258 
G2) 396 496 205 132 
-4 36 —111 134 —50 -309 153 -202 50 * > бер 
(3:13) 664 375 162 
то -84 231 —226 5+ $1 —374 392  —48 149 357 
(3721) 227 105 520 
—12 105 -—306 gas -82 -9 494 -597 78 -іо2 327 
(3°) 55 210 
1 -9 27 -29 6 4-4 63 -6 21 -45 10 
(gr? 1545 
d -6 —131 12 -33 -45 19 -195 27 -(6 78 -6 42 
421?) -474 132 154 -099 740 -п2 240 —304 24 —143 
344 -110 -9 451 —536 98 -154 222 —1$ 87 
283 -74 —100 463 -477 66 —159 216 —18 от 
— 188 бо so -250 324 -бо 96 -157 15 —44 
—76 22 26 -122 124 -19 52 si 3 -26 
—107 2 38 -165 160 —23 56  —70 6 -36 
207 -82 — 94 434 -47 74 -ISI 204 -18 as 
Kb а ee 58 -5і Bí es 
-150 36 а 250 275 -34 104 —142 15 -40 
бо  —19 -18 go —104 10 —3$ 47 =а 18 
87 -22 731 135 —139 18 -46 бо -6 30 
— 160 44 28 -229 208  —38 70 —127 15 —45 
59 -IS -18 93 -117 15 30 72 -із 15 
–67 18 24 —105 106 -16 37 = 3 -24 
65 -23 -1 Во —103 22 725 а а 16 
46 -и -17 77 -% 0 729 35 -3 17 
= — 
lh, hh? hih hih 38 


504 252 84 


357 182 63 
246 129 47 
166 99 35 
108 62 26 
220 118 42 
із: 81 з 
97 55 23 
er 37 27 
86 48 20 
53 д^ „За 
27 18 10 
136 71 26 
ge ado ОУ 
52 31 14 
44 26 12 
26 17 9 
20 13 7 
ма 307 25 
1752 5 
25 16 8 
20 13 7 
8 6 4 
D ло,” 
730 
ал 
= 88 
45 4 
== 
15 —36 27 
„Каз сер 
16 —32 9 
Fee © 
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Table 4.10 
w=ro (i) hy? hih’ hè ht h? h' hth? hy hh’ Аһ. hh, hs? 
m 3628800 1814400 907200 453600 226800 113400 604800 02400 151200 
(11) Ч 4 4 3024 
(21*) M 927369 473760 241920 123480 бзооо 322560 164640 84000 
(210) 247320 129060 67320 35100 171360 89400 46020 
28 — 259 834 
(219 68868 36756 19620 90720 48420 25848 
=35 330 — 1090 1476 
(2*1) 20100 11010 47880 26160 14310 
15 —145 496 — 708 371 
(29 6210 25200 14100 7920 
=f 10 —36 56 -35 6 
(317) $ 122640 64680 34020 
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MISCELLANEA 


Som i 
e procedures for comparing Poisson processes or populations 


By ALLAN BIRNBAUM,* Columbia University 


1. INTRODUCTION 
two Poisson processes arises in a varioty of applications. For example, if 
loth, paper. or wire is inspected continuously for flaws, the а 
often be assumed to have the probability distribution 
weds 


ey (к= 0,1,2, 


whero А; 
hero A is the mean number of faults рег unit length. Hence the problem 
roblem of compa ring the parameters Аһ 
nter is used to record emissions from a radi 
issions recorded during any duration of time # may 
‚ where А is the mean number of emissions 
lem of comparing the emission rates of two such substances 
ho problem of comparing the parameters 


Ti 3 
R + problem of comparing 
2 казыш substance like ¢ 
aults observed in any length ¢ may 

р(х, A0 = 

of comparing two kinds of 
A, of two Poisson processes. 
ioactive substance under 


канык if a Geiger cow 
Je assume vn ant conditions, t! 
recorded = "un havo the probabi 
Observed hn unit interval of time. 1 

indor comparable conditions 


40 В 
2 Of two Poisson processes. 
SIGNS 


EXPERIMENTAL DE: 
z such problems is the simultaneous 


2. SOME 
ich will be convenient for many | 
uch processes characterized by distributions 

curs, i.e. until either ту = 1 


A si 
Ве procedure wh grin 
Dhe, “ee of the two processes. done bur dem 
or 00), Pla Аз, t) are observed simu aneoust) | ge = ; 
of зе 1; fa књ Д ihe above examples, 2n ‘even ‘ Ew ip ha i уз dee eos 
& count, (The icnultanaoUs occurrence 0 two or more evonts , either in the same or in ifferent 
Tocessos, has pr simu pens pence this possibility may be ignored.) Then the probability that tho 
frst ovent el d D pote : ion that between t and t+At units of inspection aro 
"curs in the firs! 


Perfor ss, on the condit: 
med before it occurs, 18 


he first ‘event’ oc 


$’ consists of 


f pis the probability that the 
17 which depends on Арда 
tinued, and if we set у; = 

eriment is seen to provide 


independent of t. Hence i 
à 1+ (АА) 


this 1 
E луда + Ae) L 
i ection 15 con 


Who; 
TO с 
fi ? € approaches 0 wi th à 
the first process, 
i taneous insP' 
sh simul ghon 


ть ey, А 
only bed is observed in 

"Onc rough their ratio у = ^2/^ T Е 

Ccording as tho ith event is observed m 


80 
Чаопео of independent Born 


Pr {yi = ре} 
wl pre zt? (ie 12) 

1010 р = 4 isons of the two processes in 
po, 2 = Џа y) ] to to express comparisons ses 
we many (Б ^ i all) purposes it is арр Us a anii g y may then be answered by use of the 

A Statistical ques ст observations 4 . 
pA А against the alternatives А,>А»› with 
+ Тһе requirement may be met 


ms 
of the ratio y= ЛА E 
for dealt 

equire ot ША ade 
иһ power. Ti 5 whe 74 d uiroment 
tions Yi f the hypothe Age 
ions Yi — 0:95 when Р = $ A 
У rmal distributio 


Vane 
ario 
us 8 
15 mothods available ng Wi 
Meus a 
atives p» b 


Py 
Signi oblem 1. Suppose that it is T 


ificance level а = 0:05. and у? 
n for approxi- 


а 
тањи based on the observ® d power ftl 
ifi ; = 0:05, am ‘een ables of the 10 d 
та 1 level % ыс o bino: mial stribution Ga eee ding critical value т, WO have 
ation) . Using tables 0 hi eof? such that, for 80) 
we find the smallest У Е zi 2406 
Pe Ў ш>та\Р^*]` 
jel 
ES 9 
апд P У y> ral? {|>0 р 
i=l the National Foundation for Infantile 
* Th; writer was © ployed po cal Research. The writer is indebted 
Paral his work was begun while the Y apport of the [NOT rand i particular for suggesting 
to th Ysis and was comple with the 8 ot жы draft of this рар 
the © referee £r helpful comments on Eos 


Method used in §3+ 
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i E i aik's le of с and у for various sample sizes 
ies xi i /hich was used to obtain Patnaik's tab ж о wi: 
ее эрага а small values of v*. A corrected table of c and v was giv en by B. А. БН Ыг 
оон only to the nearest 0-01 in c and 0-1 in >. This note presents in Table 1 X s A P3 
apes = 5 tly published moment constants for Б 
t ingle sample case (m = 1) based on recently p 1 › л е 
+ ЖҰМЫ. Wes ы samples (Hartley & Pearson, 1951) and an interpolation of the P-funetio 
ribu 
se intervals. - . - 
"E Кыллы that if these corrected values of с and > are employed, the approximation to E e A 
tribution of range by cy/Av is somewhat better than was indicated by Patnaik (1950). T hus result: de 
Lord (1947), obtained exactly by quadrature, provide factors by which the range (w) in a single арт: = 
>» . А 
of n should be multiplied to obtain confidence limits for the mean (д) of the normal population. 
which the sample has been drawn. For example, 95 % confidence limits can be obtained from 


V Јоду, 
~ ы imit 
where 2 is the sample mean and је; is given in the last column of Table 1 below. If toos is the 5 % limi 
of Student’s ratio (two-sided test) for у degrees of freedom, then Joos may 05 БЕ: 
Comparison of the last two columns of the table Shows that the agreoment is excellent. The differences 


В а 2 
are negligible for all practical purposes. The agreement is not unexpected in view of Pearson 8 Du. 
findings that the Patnaik y method provides an excellent approximation to tho probability integra 
the range for n = 4, 6, 10, 15. 


be approximated by f9.95/(¢ in). 


Table 1. Scale factors and equivalent degrees of freedom for y-approximation 
to range in normal samples 


r 
imits* 
No.in isses of Soale Equivalent Confidence limits 
sample freedom factor two-sided 
с 0, 
ш ^ 5%: + t[(c n) Lordt 
2 1:0000 1-41421, 12-7062 6-3531 6:3531 
3 1-9845, 1-91154, 4.3349 1.3093 1.3039 
4 2.9291, 2.23886, 3-2265 0-7206 0:7166 
5 3:8266; 2.48124, 2.8967 0-5095 0:5066 
6 4-0772, 2.67252, 2.6249 0-4010 0-399 
T 5:4841, 2-82980, 2-5038 0:3344. 0:333 
8 6:2511, 2-96288, 2-4233 0-2892 0-288 
9 6:9818, 3:07793, 2:3658 0:2569 0:255 
10 7-6798,_ 317905, 2-3228 0:2311 0:2301 


* Multipliers for sample range to get 95% confidence limits of the mean. 
T From direct quadrature (Lord, 1947). 
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* Compare the values of с and » given below with those in Patnaik’s Table 1 for m= 1. 
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The third moment of Gini’s mean difference 
By A. R. KAMAT, University College, London 


U.S. Nair (1936) has given an expression for the variance of Gini’s mean difference, defined by 
2 т & 

w= а 921. a 
Nair's formula applies to any parental distribution of tho a;, although its actual evaluation may be 
tedious. Recently Lomnicki (1952) has obtained a simpler expression for the same by a more direct 
approach. In this note we dorive the third moment of g when the parental distribution is normal with 
the help of the absolute moments of normal distribution obtained by Nabeya (1951, 1952) and Kamat 
(1953). It may also be observed that this method of derivation is considerably simpler than those adopted 
by Nair or Lomnicki for deriving the variance of g. 

Геба; (i = 1,2,..., п) denoteasample from anormal population, N (4, с). Then ifwe denotez,; = 2,—2,, 
Zi; is №0, /20) and p(z;,2;,) = +. For the evaluation of the first three moments of g, we require the 
following expectations which are readily found by using the formulae for absolute moments (seo, for 
example, Kamat, 1953, pp. 26-7): 


2 Р 2 У а“ һ 
80125) = Те 6 (25) = 20%, Elza l zil) = z; CVF) 


80° : 2 503 
82 |") = Jr’ 6 (ги [язь |) = = 
803 ( 1 
Ell zal | га | га) = Sp (бонат » 4 
? ы з (2) 
с 
&(|2 112112) = “Ж. 
Воз (1 r d 
é(25| Па) = ә s —}sin1}+sin EL 
803 (J3 т 
“ЖЕ? Е] [2 |) = Ey zt А 
To find tho moments of g tho following expectations have to be evaluated: 
2 
«(ду = р S. 
(g) TE jee! ul 
Pts : PEDI [2 |+ 12011800 m 


Ж n*(n — 1) 


" sz 
£g?) = apap f leh + Eeh |n 2 |a D 071 [za | Л +E | zis | | ИЕ 
4 |2 || = Пан] | zix | 12 | + | zis е | | 222 | 


Where 4, j, k, l, p, g ате all different and the summation symbols denote summation over all of them with 
ө definition (1). 


the і imitati i sed by thi б Б се 
я н vaio de of terms in the sums of the various types occurring in (3) are as 


о 
lows; Type of sum 


ЕЗ 


Number of terms 
4n(n—1)(n—2); 


КИЕН 4n(n— 1) (n— 2) (n — 3), 
| —1)(%—2), 
ә) | zix | n(n—1) (п 
= = =3), 
zi E dn(n— 1) (в 2) (n—3) = 


4n(n—1) (n— 2)(п- 3), 

4n(n—1) (n 2), 

4n(n—1) (n—2) (n— 3), 

рут 1) (n—2) (n — 3)(n— 4), 

(n— 1) (n—2) (n—3) (n — 4) (п- 5). 


[а | | + | lal 
ИЕЛЕ 
РЕ ДЕ 
ЕЕ 


1 
ПЕЛЕ 287 
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в given i n i ters simplification, 
Substituting in (3) the values of expectations given in (2) and noting (4), after some simplifica 

u 2 f 

we have the following first three moments of g: 


20 (5) 
= — = (1-128379) с, 
Ах ул ( 19) 
4g? 


fin = пи ул (2V8— 4+ lr) n+ (0— 443 + 4л) 


2 


= (0-651006) n+0-151508} (6) 
n(n—1) 
0-651006 


т 


9? for large m. 

so a —136— 80/2 142-120 — 1444/3 — 271)) 
Hs = те (1S V2 + 40— 36/3) n* + (1443 — 136 80 \/2) п + (96./2 + 120 у 

3 


= adn ps (0393063) n? + (0-399735) n + 0-094822) (7) 
n*(n—1)* 
(0-393063) сз : 

Su for large n. 


n 


Fora = 3 we know that g = $w, where w is the range. 


up with the third moment of w given, for instance, by 

Although it is not possible to say much about the distribution of оп the strength of its first threo 
moments only, the following values of V(H2)//. and f suggest that it may be close to the N-distribution. 
This is confirmed by the y-approximation to the distribution of tho range w for п = 3, since in this case 


9 = 3%. The fourth column of the table gives for comparison the values of f, for the y-distribution 
having the same (ие). 


It is casily verified that Hlg) given above checks 
Hartley & Pearson (1951). 


" Й Й, for 
Миз) A, X-distribution 
5 0-3657 0:1797 0.1672 
10 0-2411 0-0708 0-0648 
15 0-1926 0-0436 0-0398 
20 0-1650 0-0315 0:0286 
REFERENCES 


ARSON, E. S. (1951). р; i 5 
ав him ee Э ( ). Biometry ika, 38, 463. 


In some theoretical work, particularly in 
of certain Statistics are во complicated. at le. 
experiments in the laboratory to derive some 
of systematic sampling, which is often 
the method of random Samples, In suc} 
entire population is needed, the rando; 
that the order properties of samples 


a cluster of points. 
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One disadvantage lies in the greater difficulty of determining the sampling errors involved. When an 
idea of the sampling error is not available, the method can be safely used only when there is some external 
check on the resulting estimates. It has not been possible to evaluate the sampling errors of the method 
proposed in this paper, but some check is attempted by using some known theoretical results. 


2. SYSTEMATIC SAMPLING FROM A BIVARIATE POPULATION 


We consider first the problem of finding the probabilities of the 120 possible rankings of five from a 
bivariate normal population. In another paper (Sundrum, 1953, submitted to Biometrika) these pro- 
babilities are used to determine the third and fourth moments of Kendall's rank correlation coefficient 
in the non-null case. As it appeared extremely difficult to evaluate these probabilities theoretically, it 
Was decided to use estimates based on sampling experiments. 

The usual method based on random samples of constructing samples of five from a correlated normal 
population is to take independent sets, say of five Хх and five Y's, from Wold's Tables of Random 
Normal Deviates, and then for each set to find the quantity Z = pX + /(1— p?) У; then the resulting sets 
of X and Z may be considered a sample from a bivariate normal population with correlation p. The 
extent to which the sets of X and Z represent samples from a correlated normal population depends on 
the extent to which the original sets of X and Y represent samples from an independent normal 
population. 

Now, if samples of five bivariate observations of А5 and Y's are drawn from an independent normal 
population, and the X's are arranged in order of magnitude, then it is well known that the 120 possible 
rankings of the Ys are equally probable. In order to get better estimates for the correlated population, 
We therefore adopt a method which ensures that the samples of correlated observations are constructed 
from samples of independent observations, when these independent observations are made to conform 

| exactly with the condition of equi-probability of the 120 rankings. (In the following a set of Х'в and Уз 
will be referred to only by the ranking of the У’; when the X’s are arranged in order of magnitude.) 
The method used is as follows: five X’s and five Z's are drawn from Wold’s Tables. The five Z's are then 
permuted in the 5! = 120 possible ways. Each of these sets associated with the set of five А7 represents 
а sample from an independent normal population. If, then, we calculate for each of the 120 bivariate 
Observations the quantities У = X +Z (where Z' represents one of the 120 permutations), we get 
120 sets of five Y's each, which when associated with the X’s may be considered as samples from 
à bivariate normal population with correlation p = 1//2. 
To illustrate the method, consider an example with three bivariate observations. From the first six 


numbers of Wold’s Tables, write & =68 137 —0-18 
Z 035 282 212 
The 3! = 6 samples from an independent population are then 
х 2 2, 2; 2 25 2, 


— 0:8 0:35 0:35 2:12 2:12 2:82 
0% 2:12 2:82 0:35 2:82 0:35 2:12 

1:37 2:82 2:12 2:82 0:35 2:12 
Then the samples from a correlated normal population are 
Y, = X +Z, thus: 


obtained by calculating the values of 


X Y, Ys Y; Y, Y, Y, 

— 0:34 -049 0-49 1-28 1-28 1-98 1-98 
"ic 1:04 964 O17 264 — 017 1-94 
137 419 3-49 419 1-72 3-49 1-72 


samples by taking the permutations of the X’s. Thus, with 
can generate 240 sots of five bivariate observations each, 


Tn the same manner, we can also obtain 5 
& samplo of five bivariate observations, we 


"ma lation. 
representi les from a correlated normal popu 
ih Pris a v ~ пе procedure may be adopted. For the above example, we construct a 3 x 3 
ico, a systema 


able as follows: aS 


| | 
<“ Ж —0-84 | -o1s | 
Zi ^ | | 
- = | 
.35 — 0-49 | 0-17 | 1-12 
$12 1-28 | 194 | ie | 
-98 2-64 у 
2.82 1:9 | | | 
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i ; e.g. in the 
i d sof X ^ 5 ling to each cell; e.g. in 
d in by adding the values of X and Z corresponding eee 
The body of the in gai сы put а — 0:84 4+ 0:35 = — 0-49. We then get a set of seein ai 
first cell on ye г taking elements from the first, second and third columns, such that no x = ШЕ 
унн Si r ilarl у, a set of three values, correlated with Y, is obtained by taking сна iB dorm 
i о. on ний third rows such that no two are in the same column. This can be done fair! уч: 3 
the first, seco: 5 у 
i i ay even be mechanized. | | : cn 
E m pe obtain samples from a bivariate normal population with correlation р F ИН = 
ivi i $ ive Z's wer < қ 3 of Wold’s T'ables; each o 5 
1 i 5 Y зге taken from p. 33 of Wold’s T'ables; 1 
і ivariate observations of five X's and five 25 were кышына or 
E nd to generate 240 observations from the correlated population, giving in all 12,000 d ee 
serva tions When these 12,000 observations are classified, according to the ranking of the Ys, we o 
the following distribution: 


ы к b 51234 1 
12345 1,044 21345 631 31245 299 41235 123 а LE 
12354 554 21354 274 31254 124 41253 42 51549 2 
12435 719 21435 404 31425 161 41325 — 80 I + 
12453 282 21453 154 31452 57 41352 29 51 Ш 
19534 339 21534 132 31524 46 41523 39 51423 
12543 942 21543 121 31542 32 41532 20 51432 К: 
13945 631 93145 301 32145 233 42135 92 52134 г. 
13254 381 23154 133 32154 124 42153 38 52143 B 
13425 345 23415 131 32415 110 42315 07 52314 1 
13452 199 23451 96 32451 29 42351 10 52341 x. 
13524 183 23514 53 32514 62 42513 9 52413 11 
13542 95 23541 21 32541 24 42531 11 52431 5 
14235 416 24135 158 34125 41 43125 63 53124 16 
14953 146 24153 38 34152 14 43152 18 53142 5 
14325 994 24315 113 34215 59 43215 65 53214 20 
14352 95 24351 97 34251 13 43251 15 58241 7 
14523 85 24513 16 34512 9 43512 $8 53412 7 
14532 54 24531 21 34521 6 43521 10 53421 4 
15234 179 25134 42 35194 2] 45193 19 54123 14 
15243 107 25143 97 35142 13 45132 9 54132 5 
15324 190 25314 29 35214 29 45213 5 54213 6 
15342 5g 25341 12 35241 12 45931 5 54231 3 
15423 62 25413 9 35412 9 45312 6 54312 1 
15432 39 25431 14 354291 4 45321 5 54321 2 


Total 12,000 


© can draw five sets of four bivariate observations: 


example, from the 
me and re-ranking tho remaining 
ted by (4312), (1423), (1423), (1432) and (1432). Thus, we сай 


sample represented by (15423), by omitti 
observations, we get the samples represen 


Observations with four bivariate points 
each. The distribution of these is as follows: 

1234 12,891 2134 6740 3124 2733 4123 847 

1243 6,588 2143 3153 8144 944 4132 511 

1324 17,571 2314 2788 3214 2199 4213 586 

1342 2,825 2341 748 3241 613 4231 292 

1423 3,207 2413 1017 3412 395 4312 3922 

1432 1,979 2431 570 3421 313 4321 238 


We can continue the same method to obtain the di: 


Stribution of ranki f three observations fro"? 
the abovo distribution of rankings of four, Wo then get: nkings of three obse 


123 106,510 


132 48,096 
213 47,946 
231 13,899 
312 14,362 
321 9,194 


Total 240,000 
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At this stage we can check tho ‘quality’ of this sample by the correspondence of the above distribution 
with some theoretical values. I have shown elsewhere that the probabilities of the six rankings of three 
from a normally correlated population are given by 


3 (4 


Xx 4. 4 
Pr{123 = = 5 + = T p— саш ір- zi (sin р)? — 5 (віп”і вр] 


8 


Pr {132} = Pr {213} -i 


2 1 assi re ЕТ 
-— = 81 -sin-!1lp— 
$5 B px-sin" 3p 


i 


п 


5 Gin"! р)* + E (sin! er] 


өш ү. $ ПЕ” й. 
r{231} = Рг 2} = -{=-+- 5ш —-sin-! 4p — — (sin р) + (sn™ 3p)" 
Pr(231) = Pr(312) = tlg t; sd T јат pras Jr] 


3[4 4 4 4 4 
„199 = =! == вџу -sin-!l = (sin-1 p)? — — (sin-! 4p)? 
Pr(321) = sls zin pt =. ір- 681 р) Ei ip) | 


Evaluating theso expressions for the case of p 
of these rankings actually achieved in the sample. 


= 1//2 we can compare with the relative frequencies 
The results are as follows: 


Theoretical Sample 
123 0-4431 0-4438 
132 0-2009 0:2004 
213 0.2009 0:1998 
231 0:0585 0:0579 
312 0-0585 0:0598 
321 0-0381 0:0383 


Tho correspondence appears to be satisfactory. 
Incidentally, from the distribution of rankings of 


Kendall’s rank correlation coefficient from this popu 


Distribution of t(n = 5) from a normally co: 


five, wo may derive the small sample distribution of 


lation. The distribution is as follows: 


rrelated population р = 1/42 


The mean and variance of this dis 
gures in brackets being the known 


trib 


ution are 0-4986 (0-50) and 0-108 


theoretical values. 


t Frequency 
1-0 1044 
0:8 2535 
0-6 3041 
0-4 2452 
0-2 1518 
0-0 798 

— 0:2 390 

– 0:4 149 

— 0:6 55 

— 0:8 16 

— 1:0 2 

Total 12,000 
SS See 


18 (0-10992) respectively, the 


456 Miscellanea 


In the same manner, we may derive the distribution of the Spearman rank correlation coefficient in 
e , tribution: of 
samples of five from such a population. The distribution is as follows: 


Distribution of r,(n — 5) from a normally correlated population with р = 1/42 


7, Frequency fs Frequency 
1-0 1044 = 0-1 235 
0-9 2535 —0:2 116 
0:8 1059 — 0:3 142 
0-7 1982 — 0:4 55 
0-6 1242 — 0-5 67 
0:5 893 — 0-6 55 
0-4 562 — 0:7 37 
0:3 896 — 0:8 18 
0-2 320 — 0:9 16 
0-1 505 — 1:0 2 
0-0 219 

Total 12,000 

ы 


The mean and variance of r, from this distribution are 0-5938 


and 0:12365 respectively; the theoretical 
value of the mean, from Moran’s formula, is 0-595, 


2 sont 
© X’s and Z’s do reproser 


tisfy the requirement that all the (1 ways of 


т | Possible ways. From each of these sets th? 


fn¥’s by addi { f thom: 
be used to 5 Roe У adding the quantity 4% each о 
О study the distribution of such statistics ne манга two-sample 
4. Concrusioy 
very much 5 


рк pe 
ticableto me because of the systematic nature of t 


In using this method, + 
sampling. However, i 
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A note on ordered least-squares estimation 
By F. DOWNTON, University of Liverpool 


Tho parameters jr. с of a distribution having the form f{(«—j)/o} may be estimated by applying the 
Method of least squares to ordered observations, where ordering is according to magnitude. The general 
theory was given in a paper by Lloyd (1952). It was there shown that the ‘ordered’ estimate 2 of the 
expectation had a variance never exceeding that of the sample mean; and, in the case of symmetric 
distributions, necessary and sufficient conditions were obtained for the variance of / to be strictly less 


than that of the sample mean. 

In this note these conditions have been extended to include the unsymmetrical case. 

Lot a, а, ...,2, be a sample of n independent observations on a continuous variate X whose dis- 
tribution has the above form. Tho possibility of repetitions among the 275 then has zero probability. We 
may therefore, with probability one, write 

прото X Tu) S +++ < Vin) (1) 
for the ordered observations at. 

Ted 2,-(а,-/0/0, ду = (0910/0. 
be the reduced observations, unordered and ordered respectively. They have parameter-free distributions, 
80 that their moments may be regarded as known. 

Clearly, 

бг) =0 (r21,2,3,...,n). (2) 

Lot (г) = Фе COV (2,2) = ws (7,8 = 1,2,8,..., п). 

Let а denote the (n x 1) vector of the а,, w the symmetric, positive-definite (n x n) matrix of the 
%,, Тап (n x 1) vector of I's, and x the (n x 1) vector of the zi. 

We note for future use that 1'& = 0, that а + 0, and that 1^w1 = n. The first of these results follows 
from tho fact that the z, and the 2 are simply permutations of the same set of numbers; hence 


Iz = Izi and 
а Va = Ха, = Хб) = (556) = 6(Х2) = 2602) = 0. 


A similar argument shows that wi =n. р | А 
The second result follows from (1). For with r<s, we have 2) < z, and, taking expectations, а, < а. 


"'hereforo а. + ot, and hence a = 0. . 
Tho ‘ordered’ estimate д, of и, and its variance, are given by 
Ё = ал“ (a1' — 1a’) w^x/A, 
var (fi) = a^w-!ac?/A, 
А = (1^w-1) (a wa) — (1w-la)?. 


Lloyd showed that when the distribution is symmetric, Гута = 0, €— mamang са аа 
these expressions. He also showed that а necessary and sufficient condition m the Бата ш, p 
Strictly smaller than 6?/n is that wll, or, equivalently, that var (2) = ain i m yi = zh Te 
Showed further that when the variance of Й equals that of the sample mean, / itself coincides wi e 
Sample mean. 

In tho gener a 
Positive-definite and symmetric an 


Whero 


"al case, where no symmetry assumptions are made we first note that both w апа w~! are 
өзен p d may therefore be expressed in the form 


w-tt and w= (tyt, 


Where t is a lower triangular matrix. 
Consider any two (n x 1) vectors b and c. 


Then b/wb = b'tt'b = ВВ = Eh say, 
Wher h= vb. , te — 2 
pcr. tb c^w-lc = СЕЛУ tc = КЕ = Eki say, 


Where k — і-іс. А E i 
Now by the Cauchy-Schwarz inequality we h 
(512) (Z3) > (Eh, kj)". 


ave 


(3) 


458 Miscellanea 
and the necessary and sufficient condition for equality is that А 
h;= Ak; for some constant A and for all i. 
i 3 d (4) become 5) 
,..- (b^wb) (c^w-!c) > (b’c)?, ( 
Е 6) 
with b= Awe as the condition for equality. ( 


Put b = w-11—1 and с = а. Then (5) becomes 
(1^w-11 — 21/14 1^w1) (awla) > (I/we — 1'ш)?. 
Now ll-l'wi-n and Va=0, 


Thus (WH — п) (алуа) > (дула), 


or КО) (ау а) — (Тула) (a ^w ла) = o*/var (д) > n, 

^ Ж :tion for 
since a^w-! is essentially positive. Thus var (4) <0* т; and tho necessary and sufficiont condition f 
equality is, from (6), that 


of the sample mean, but Ё itself necessarily is the sample mean, 
For, pre-multiplying (7) by a^w-!, 1^w-i 


9^w-1 — Ao^w-lg = o'l = 0, j 
€ 
Uw — МА а = 11 => (9) 
and УИ А-а = 1, 09 
respectively, 
Now А = (a’w-lq) (1^w-11) — (1^w-!g)? 
= awe СОСЕ from (8) 
= na^w-la from (9). R^ 
Also 


йт («^зу-161/_ ew 


Ча) зу-1х/д 
= a’ wa(1^w-a. 


Aa’ wl) худ from (8) 
= @/wolel’x/A from (10), 
whence Д = 1’x/n, which is tho sample mean, 


Тат grateful to Dr E, Н. Lloyd for an extremely stimulating correspondenco on this subject. 


REFERENCE 
Lrovp, Е.Н. (1952). т, 


east squares estimation f locati А ‚ statistics 
Biometrika, 39, 88. oF focation and scale parameters using order stai 


y i$ 
Moran (1948) and Kendall (1941)- dois 
Ө Problem and to summarize the po! 
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2. Li evi isti 
et the event Е; consist in the random variable x; being positive. Thus 
Р{2;>0 = Р(Е) = + 


For two vari. 
wo variables x, and га we have the well-known result of Sheppard (1898) 


(2 0,2550) = PUES} = ЗЕ d А 
- п 


"es This result is simply hieved by an а] 1 T 
PARA ply achieved by an ap eal t ; 
1566 козе for three variables. With three variables the probability ellipsoid ia be huic ша! 
to show for ihr » ithe original co-ordinate planes becoming three planes through the origin. It is easy 
platiesis бей sa ӨР тоге) dimensions that the angle between the ith and jth transformed co-ordinate 
Probably м = T Pir where р; 15 the correlation between the original variables x; and жу. The required 
а е bo У ог three (ог more) variables is just the solid angle in three or more dimensions when the 

;»etween the planes containing the solid angle are given. An exact expression for this solid angle 


appears 
pears to be known for three dimensions only. 


where ЈА а 
Різ is the correlation between гү and ta. 


tion for the required probabilities is possible. This relation has its 


analog Е 
Дес in many branches of mathematics: wo may quote, for example, any text-book on multi- 
sional geometry. It was also partially explored іп relation to an n-dimensional integral by 


Schaefli я 
паей! (1857). The theorem usually ascribed to Boole is as follows: 


3. A curi 
A curious recurrence rela 


> P(E,E,Ej) — (7 19 P(E, В... Ез). 


ЕЕ}+ 
i<j<l 


PU, EA. + By} = БРЕД- ХР 
i i<j 


Now for tho problem considered 

P(E, + Est- +8} = 1 2р, By... Ё„) = 1 — P(E,E, ... En} 
variato normal distribution. Henco for n odd wo have 
> P(E,;Ej) — .. ее. 


i<j 


b 
Y the symmetry of the multi 
P(By By... By} = y1-XPU 
i 


thi 
Е Нана term оп the right-hand side being the sum of the joint probabilities of the E's taken n—1 at 
©. Tho result for n = 3 is immediate: 
1 3 3 соз іе сов pis CO8 Pes 
PUB Hala} = "Кыл жщ эл 


1 =! 
=— [27 — соз- руа— 6087" Pa 008 1 Das]. 


4m 


For 
even wo got an identity. 
for п = 4, but it does 


deduce the results for n = 5 given that 
which will enable us to make the step from 3 to 4. 


~ Seem possible to obtain a recurrence relation > 
tan, following Kendall, gave an expansion for РЕ, E,E,E,), but experience has shown that this 
i urposes except for pi; small. For example, for pi; in the 

in reasonable accuracy. Some 


пој 

, Shbourhood of 0-5 at least twenty 5° necessary to obtain re 
ТК has been done towards obtaining 8 series expansion for Ри nearly 0-5 in terms of (p; — 0:5), but 
қ У in certain restricted савез. What is requir kly convergent series for pi; large and n even. 
di Om n = 4 wo deduce n = 5, calculate n = 6 es, deduce n = 7 and so on. The problem is 


iffi 
Cult but not unrewarding. 


4. а 
From the recurrence relation we can 
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i ty- or leptokurtosis (i.e. the excess kurtosis is negative 
sii kee A ees on Cope Eee Santa + 3). If p = (2+ 4/3) there is по excess kurtosis for 
рл) ‘ining 3 iae large, y, and у, both decrease towards zero, and with any finite separation 

Қу qs netu een tends de infinity (with infinite separation, Уз diverges to infinity). W ith zero 
eee и ae e d Ул and у» are zero for all values of р. The curves of constant separation are 
pak зілі: Ada d and with large values of p, near the origin, closely approach the straight lines 
У sad "The curves of constant ratio p are quartic parabolas, УУ = p p — 1)! (p? — 4p + 1)? (constant). 

g= Я 


4. USE OF THE DIAGRAM 
To estimate p and à bythemethod of moments when we have available a, sample of n individuals from the 
composite population, we obtain in the usual way from the k-statistics estimates g, and 9 of y, and У. 
These may be used as co-ordinates in the chart to obtain approximate values r and d for p and à, by 
interpolating between the contours. An estimate, s, of с may then be obtained from 
8 = (r+ 1) (+ 1)? + га, (3) 
while the estimates m, and ту Of из and jt aro given by 


m, = k, — ds|[(r-- 1), т» = E, + rds[(r + 1). 


(4) 
Here k, and k, aro tho first two k 


-statisties of the composite sample, 


5. PRACTICAL USE ОР THE METHOD 
We take as a first example the data used b: 


y Rao (1948) for the Same purpose, viz, tho heights in centi- 
metres of 454 plants, shown in Table 1. 


Table 1. Heights of 454 plants in centimetres 
Central value 8 9 10 11 12 13 14 15 16 17 18 19 20 Total 
Frequency 3 9 21 40 59 76 79 69 46 30 


13 7 29 454 
The k-statisties (calculated by Rao) are: 


ky = —0-244 (about 14 as origin), k, = 0-729, 
Га = 4-976, k, = — 5:315. 
Hence Ф = 0-066, g,— — 0-215, 


From equation (3), 8 = 1:836. 
From equations (4), 


та = — 0244 — 9.57/9.4 = _ 1:31, 
Ma = — 0-2444 1-4 x 2-57/9-4 = 1:26. 
The chief source of error in theso estimates lies in the difficulty 
diagram, Assuming a standard erro 


of accurate 
T of + 0:05 in the estimation of bo: 
the results due to errors of interpol 


estimation from the 


th ке f 
ation may be found fre r and d, the standard errors o 
оз, = c(l +p), [p = т/(т+ 1), 
о? 0803 
eo По apto pon 
Om za (210 41 бер ба ог Á 
т, 4 (р + 15 P + 1) + 8°] боз + 4p + 1:23), 
- 6% 
m = ак рт) р + D+ pa} 802 аро р 1:25), 
The results of these calculations, + 


and of maximum likeli 


ogethi itht i 
ood ant rig er with the theoretical Tesults of Rao by the methods of moments 
are given in Table 2. 


19 standard errors of the methods due to sampling and assumptions; 
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Table 2 


|, Estimated value by method of Standard Standard 
Parameter | error of error of 

estimated Estimate ^ —— estimation | max. 

Ма: | | њу from diagram | likelihood 

Max. — 1 Ys с.=о,= 0:05 xstimaltes 

likelihood | Moments diagram (r= 04 ) estimates 

| 
— "— | ыы аё ли 
piQ d p) Di | 0583 | 0-58 0-009 0-161 
Ln ту — 1-341 —1:31 0-034 0-049 
Its ту 1-286 | 1:26 0-043 0-033 
с | 8 1-816 1-84 0-021 0-049 
| | 


The reading errors are rather large, but could be reduced if necessary by constructing a large number 
of contours on as large a scalo as was convenient. Also, in some cases the errors would be greater owing 
to the shape of the diagram, but p and 6 are usually small (p <5, 1 < | ô| <3) and can then be estimated 
with considerable accuracy. 

А second example may be taken from the distributions of length of trypanosome strains analysed іп 
some detail by К. Pearson (1914). The second column of Table 3 shows the observed frequency distribu- 
tion of the length (in microns) of 1000 individuals of Trypanosoma gambiense as recorded by Pearson. 
The moments and moment ratios of the distribution, using our present notation, are given by Pearson 


(p. 131)% as 
kı = 22-1130, g, = 0-5361, 


ka 


11 


14.3389, 0% = — 0:4156. 
entering our chart with these values of g, and gs, we read off the estimates of p and ô as 


т= 8.70, d=3-15. 


Hence we find 
та = 27-175, 


s = 2-202, m, = 20-238, 


ру = У(1+7) = 0-730, pp =1-р = 0-270. 


Tho oxpected frequencies, based on the two normal components with a common standard deviation 
estimated by s, are shown in the third column of Table 3 and the differences (observed — theory) in the 
fourth. After combining four groups in each tail, it is found that y? = 15-28, with 13 degrees of freedom. 
'The probability of exceeding this value if our theoretical structure were correct is 0-29. The fit is quite 
good, but it is evident that there are certain systematic differences between theory and observation, 

K. Pearson (1914, p. 131) did not assume a common standard deviation for the two components and 
used his general moment method of fitting (1894) involving the solution of a nonic. He found 


т. = 26-2463, S, = 2-6260; 
Po = 0-3495. 


s, = 2:0566; 


p, = 0:6505, 


та = 19-8926, 


Using these estimates of the parameters, the theoretical distribution given in the fifth column of Table 3 
is obtained, with tho differences (observed — theory) of the sixth column. In this case ү? = 10:77 with 
12 degrees of freedom, a value exceeded by chance with a probability of 0:55. The fit has been improved 
by allowing the two variances to differ, but there is still evidence of some systematic discrepancy, as 
shown by the run of the signs of differences. 


6. SuMMARY 


This graphical method of analysis into two components by means of the skewness-kurtosis diagram may 
be useful in many cases where a simple, rapid determination of the probable nature of the two sub- 
universes is wanted. It is less efficient and less accurate than existing theoretical methods, but thero 
àre always unavoidable errors due to sampling and necessary assumptions about the components. It is 


* Pearson gives values of moments, with n as divisor, not k-statistics, but with » — 1000 the slight 


Adjustment called for seemed unnecessary. 
30-2 
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i i lue, i rast to these methods, in producing quick approximations 
ен кү. бон те а pe bind. of distribution may be асый by this method into two 
ә e irrespective of whether its character is, or is not, due to their physical exist prc; 
WU erm pee aad the diagram lie within the bounding parabola; hence the method cannot offer 
ens шы of their physical existence. It may, however, permit us to devise physical and experi- 
ats tests to identify their physical nature if their existence seems probable. 


Table 3 | 
Dserved. Distribution Pearson's | 
Central Observec fitted by Difference composite Difference 
троа frequency diagram distribution 

12 -- 0-2 | 

Е 5 
18 thio eerie! тә 92106 | —0-6 
15 9 8-1 4 | 
16 21 21-2 -0-2 21:6 — 0:6 
17 56 45:3 10-7 47:4 8:6 
18 79 78-9 0-1 82-9 —3-9 
19 114 112-3 17 115-1 —1-1 
20 122 130-5 —8:5 128-0 — 6:0 
21 110 124-6 - 14-6 115-8 - 5:8 
22 85 98-9 — 13-9 89-2 —42 
23 85 68-8 16-2 65-6 19-4 
24 61 48-7 12.3 54-5 6-5 
25 47 43-3 3-7 53-3 — 6:3 
26 49 46-7 23 54-3 — 5:3 
27 47 49.7 -27 51-0 – 4:0 
28 44 45-6 —1-6 42.4 1-6 
29 31 348 —3-8 30-7 0:3 
30 20 21-6 — 1:6 19-2 0-8 
31 11 11-0 0-0 10-5 0-5 
“ТЕЛЕ * 
34 —[8 0-4 [67 13 2079 0-1 
35 — 0-1 0-1 

МА Total 1000 1000 1000 


'iginally began investigation 0 д 
aboratories, Butler, fO! 
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A note on regions for tests of kurtosis 


By G. E. Р. BOX 
Imperial Chemical Industries, Dyestuffs Division Headquarters, Blackley, Manchester 9 


Тһе most common test criterion for kurtosis is ба, the sample fourth moment divided by the fourth 
power of the standard deviation; an alternative criterion proposed by Geary (1935) and denoted by 
а is the ratio of the sample mean deviation to the sample standard deviation. The present author has 
shown in a paper appearing on p. 318 of this volume that Bartlett's (1937) modification of the Neyman- 
Pearson L, criterion (1931) for testing for the equality of variances is so sensitive to kurtosis that when 
thero is only one observation per group (group means assumed known) this criterion denoted by.M , is 
of the same order of sensitivity to kurtosis as is bẹ or a. 


Illustration of symmetry conditions when N=3 


я ©) & 
| 
ШЕ: 


(iii) Rectangular (iv) Normal (у) Double exponential 


Distributions and probability contours showing directions of accumulation of probability density 


Fig. 1. Geometrical properties of tests of kurtosis. 


A better understanding of the relations which the various tests for kurtosis have to one another and 
to М, is gained by considering the general nature of critical regions for such tests. For simplicity we 
shall need to assume that the mean is known and equal to zero. We notice first that any criterion to 
detect kurtosis must be independent of scale; if we reach a certain conclusion from a sample ул, ys; +++ YN 
wo must reach the same conclusion from the sample Кул, Куе, +++ kyy- It follows that the boundary of 
the critical region in the sample space consists of one or more cones with apices at the origin. Also each 
Observation has equal weight so that if the point Yr Ya +++ YN is on the boundary of a critical region then 
50 must tho remaining N!—1 points obtained by permuting the order of the values. Finally, since 
а test of kurtosis should detect departures from normality other than those due to asymmetry of the 
distribution, the test criterion must be independent of the signs of the observations. Thus the pattern 
Of critical subregions in the region of the space in which the observations are all positive will be repeated 
in all the remaining 2N — 1 regions of the space generated by possible differences in sign. The conditions 
of symmetry thus imposed are illustrated for N = 3 in Fig. 1 (i) and (ii) which shows саве us 
Subsets of conessatisfying these conditions. When the null-hypothesis of normality is true, the proba! i ty 
density in the sample space is constant over tho surfaces of N-dimensional hyperspheres with => а 

© origin, Consequently if the hypercones of the critical region are such that they ele a ee 
& of the total surface of such hyperspheres the error of the first kind is controlled at the level æ irrespective 


tho value of the scale parameter. 30-3 
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The directions of the axes of cones forming such regions when N = 3 are shown by arrows in Fig. 
l (iii) and (v) respectively. In particular b and а have critical regions of these types and so has M ,. 
Tn each ease, the particular type which occurs depends on whether we are using the criterion to test for 
leptokurtosis or for platykurtosis. The regions differ only in the shapes of their cross-section and even 
these differences are limited to some extent by the symmetry conditions listed above. For the case 
N = 3, sections of cones corresponding to the approximate upper and lower 5 % levels for the test criteria 
by, а and Л, are shown in Fig. 2 (ii), (iii) and (iv). (In order to make clear that the region in Fig. 2 (ii) 
defined by by > 6,(%) has a non-circular section, it has been necessary slightly to exaggerate tho departure 
from circularity.) 

2. It is of some interest to consider kurtosis tests from the point of view of the Neyman-Pearson 
theory of best critical regions. Suppose that the distribution specified by the alternative hypothesis is 
à member of the family 


2p (1 ivl" 
ply) = | Г|- ехр–—|-— (—о0<у< +00, 0 <9 < 00,0 <р < со), (1) 
4 4 p 
and q is given. This is a symmetrical distribution with mean zero and 


s a pti, , РТО, " 
x тд ^" U(/g9)* 
When q = 2, y, is zero and the distribution is normal. When д> 2 the distribution is platykurtic and, in 
partieular, when q tends to infinity the distribution tends to the rectangular, for (1) tends to 


= 1/(2р) (у<р) 
ни! (3) 
=0 (у>р). 
When 4<2 tho distributions are loptokurtie in particular when 4 = 1 the distribution is the double 


exponential. 

The likelihood ratio test criterion (Neyman & Pearson, 1928) for testing the null hypothesis of 
normality (4 = 2) against the single alternative that the distribution is of the form of the above equation 
with q equal to some other specific value до is readily found to be proportional to 


Ag, = (та) ““/8, (4) 

where m, is the qth absolute moment, 
т. = X|y|*/N, and in particular s = ті. (5) 
Tho inequality Nag < A«(1— 9), (6) 


whero А(1—@) and A(a) refer respectively to the lower and upper significance points of the criterion, 
dofines à set of cones lying along tho co-ordinate axes when 4, « 2 and a set of cones lying along the 
equiangular lines when qo» 2. That every such set defines а best critical region for the appropriate value 
of ду can bo seen from tho fact that on every* hyperspherical shell (4) and (6) define the region, inde- 
pendent of scalo factors in the null distribution or in the alternative distribution, the boundary of which 
is givon by tho fundamental inequality p, > kp» of Neyman & Pearson (1933). This region on every such 
shell thus contains for its sizo the greatest possible concentration of probability density when the 
appropriate alternative hypothesis is true. | Р 

We see that, а <а(1 — а) defines the most powerful test when the alternative distribution is the double 
exponential, for which у; = 3, also 6, < b(1— а) defines the most powerful test when the alternative 


distribution is ply) = const. exp — (y/p)* for which у; = — 0:812. When the alternative distribution is 
rectangular (у; = — 1:2,4 = оо) the test is defined by с<с(1—@), where 
c=|yz|/s (7) 


and уу, is the observation largest in absolute magnitude. It should be noted that with this criterion we 
Should be testing whether the largest deviate in absolute magnitude were £oo small and the test would 
be based on tho lower tail area of the distribution of c. The same criterion, but using the upper tail area of 


* This is literally true if it is assumed as in (1) that 9 < оо. In the limiting case when q, = œ, if 
We suppose a particular rectangular distribution of semi-range p to be the alternative and if r is the 
radius of a hemispherical shell, then the region defined on the shell will contain an amount of probability 
density greater than any other region of similar size if p<r< JN p. For all other values of r it will 
Contain an amount of probability density equal to that of other regions of the same size. The test is 


thus still most powerful in this limiting case. 
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son (1935) as a test for a single outlier (see also Pearson & Chandra Sekar, 
15 к ој ^n ^ri criterion are shown in Fig. 2 (i). Tt will be noted chabonly c 
lower tail areas of с, 6, and a define most powerful tests for the alternatives considered. In F ів. 2 
regions are differentiated by drawing in the axes of symmetry only for those sections which correspo 
to most powerful tests. We should expect c and b, to be particularly good criteria, there 
alternative hypothesis was that the distribution showed marked platy’ 
testing for marked leptokurtosis a would be ех 
(his Table 7) which are shown again in t 


fore, when the 
kurtosis; on the other hand, in 
‘pected to be better. Pearson's (1935) sampling experiments 
able 4 of the article appearing on p. 318 of this journal 
support this suggestion (although with so few results no firm conclusions сап be drawn). For the гесі- 
angular distribution a larger number of significant results is obtained with b. 
exponential a larger number is obtained with a. Tt does 
than b, as soon as y, was greater than zero; tho chang 

Tn a study of possible tests of kurtosis of this type, 
the symmetrical Gram-Charlier, Geary (1947) reached 
test for infinitely large samples (and correspondingly sm 
that, over a wide range of values of q, 


2, Whilst for the double 
not of course follow that a would become better 
e-over might well be at somo other point. 

and assuming the alternative distribution to be 
the conclusion that b, was the most ‘officient 


all departures from normality). He also showed 


the power of the various possible tests would not be expected to be 
greatly different. The present research emphasizes that the choice of c 


riterion for samples of finite size 
must depend on the type of alternative hypothesis which is in mind. Th 


o similarity of the critical regions 
for the various criteria confirm Geary's second conclusion. 
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The frequency justification of sequential tests—addendum 


By G. A. BARNARD, Imperial College 


In my paper (1952) on the above 
performed without explicit justifi 


for this process is now appended. 
The omission of this p 


topic, at the end of $3 ( 


р р. 147) an inversi 
cation. This has caused di pad d 


] limit operations was 
ifficulty ; 


for some readers, and a justification 


branches of applied mathematies ience 
to fill in what would otherwise b 2 © use of experie d 
density is given, u nder consideration, and its surfac 


that the lamina 15 


ed that ‘J. effroys has по 
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ns in cases of * 


the choice of prior distributio D ignorance” 


the choice...’. 


We now give the proof referre 
We wish to show that if 


-laid down general 


; rinciples which determine 
- This should read we pco 


Which uniquely determine 
d to above, 


b 
(a) (1/log b, | id 

on я” (054,0) dejo = a for all0<a<b<oo 
(b) аз а->0 and bso a” 


" 


and 


then 0 = g’. (9, а, b) >“, 
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We notice first that &”(o,a,b) = &"(cc,a/c.b|c) for any positive с. For if we make the changes 
X —cX, a>aje, b>b/e, then с со, 8->св, sa, sb, and t remain unaltered, and so А.(а.5) remains 
unaltored. It follows that we can write &^(c, a,b) = a"(ca, ob). 

We now put ¢ = loge, l, = loga, 1, = logb, in the integral, and put a"(ca, cb) -/Д4--1,,4--1,). Then 
(a) becomes 


l 
ШАЛ) ЈЕ а а = a’ for all ly <ly, 
ћ 


while (b) gives f(x, у) +a as x — — оо and y >+ оо. The second condition implies that, given any positive 
€, we ean find an A such that 


| y) – а | € € whenever т< — А and y> А. 


Now in (a), put h4 = —nA, 1, = nA, with п> 1. Then we have 
—(п—1) A +(n—1) 4 nA 
a^ = аа | f(t—nA,t4- һаа | Ји паланајф+ [' f(t—nA,t+nA)adt 
-пА —(n—1).4 (n—1) 4 
= (12nA) (Af(t, —nA, t, - nA) + 3(n— АЙ —nA, ty nA) + Аб 4, ty +nA)}, 
where —nA<t<—(n-1)A, —(n—1)A«t,«(n—1)4 and (n—1)A «t4 «nA, 


by the mean-value theorem. Nowfisa probability, and is therefore bounded, so that as n > oo, the first 
and last terms within the braces remain bounded. Hence, by taking » lavge enough, we can make о” 
differ by less than € from f(t; —n4,t,-- n4). But 4, -пА < — А and t 4-n4 > А, so that f(t, п, t, 4-24) 
differs by less than € from а. Hence z and а” differ by less than 26. 
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REVIEWS 


The Design and Analysis of Experiments. Ву О. КЕМРТНОЕХЕ. New York: John 
/ Wiley and Sons; London: Chapman and Hall. 1952. Pp. xix+631. 76s. 


3 1 Я s of the theory and 
Until two or three years ago there were few connected and comprehensive accounts of thc theory 


practice of experimental design. There were the classical monographs by Fisher and Yates and a € 
journal literature on subsequent. developments. To the statistician outside the field of agricultura 
experimentation, the ingenuity shown in dev 


ising more and more refined and specialized designs was 
indeed impressive, but the resulting complexity became, it must be admitted, somewhat. bafiling. 2 
There was clearly a need for a book which would give a unified account of all the designs, — 
advantages and disadvantages and their relation to general experimental principles, and which майр 
connect all this with modern statistical theory. As is inevitable whena gap in the literature is disclosed, 
not one but several books were written to 61 it. Quite apart from those recent statistical text-books 
which have given prominence to experimental design (among which that by Bancroft and Anderson is 
a notable example) at least four books entirely devoted to this subject have appeared in the last four 
years. (It is regrettable, incidentally, that their titles are virtually identical.) The first, by Mann 
(Analysis and Design of Experiments), concerns itself strictly with the mathematics underlying the 
analysis of different designs. The second is the well-known volume by Cochran and Cox. The third, by 
Kempthorne, is the one under review and the fourth, by Quenouille (The Design and Analysis of 
Experiment), only appeared in the United Kingdom in early 1953. It is notable, though in no way 
surprising, that three of these books are written (or partly written) by statisticians who have worked 
at Rothamsted. 
As might be expected with two books from the sam 
by Kempthorne differ in approach, 


to relate the subject matter of this fiel 
problem of experimental inference. 


д nat Cochran and Cox offer по 
guidance on general principles of design or on the a erwise of different designs, for 
they are useful and clear on both; but their book is а m ri tal design, that by Kempthorne 
is an advanced text-book. It is the former which wi service to most practical ехрегі- 
mentalists; the latter which, by and large, will be of more interest to the theoretical statistician, 
whether as consultant or teacher, There is Something to be said for usir | қ 
Possibly because its approach is more difficult, K 
of exposition which d 


istinguishes Cochran and Cox’, қ п 
helps. Іп any case, this new book is not for the beginner, nor is it = ран isi s pe pem 
reference. It should be looked upon as a comprehensive and advanced te xt-bool. i mee ith the 
concentration and patience such a book deserves, 900 and studied with 

Its comprehensiveness is indeed remarkable and Professor 
i ly the enormous 


view as possible aaa 
of statistics and to the genera 


Kempthorne must be cong 
ding a 5 


chapters, 


ratulated for 


nine chapters dealing w 


n incomplete block 
a final one, chapter 29, on tre: 


the designs, together with e 
is a fine achievement. 


ith factorial 


designs, one on 
atments applied in Sequence 
Xplanations and numerical ex, 


oH 
Worth mentionin, r ct whic 
Б one general aspe 
Y Professor Kem ле impression, not 89 
is О 
red throughout the volume. The алау ay 
" H : ү 
66 as a modification to the general two-W 


m 


Reviews 471 


model; the mathematics for it are given at length, but no explanation is given of its purpose or use. 
This comes later, first in § 8-7 in the chapter on The Validity of Analyses of Randomized Experiments 
and then in $ 9-9 in the chapter on Randomized Blocks. (The index does not refer the reader to $ 8-7). 
A certain amount of rearrangement on this, as on some other topics, would clarify the presenta- 
tion. The very first opportunity should also be taken for improving the index and the cross-referencing. 
The present index is quite inadequate and there is virtually no cross-referencing in the text. 

The theory and practice of experimental design has grown so much out of agricultural experimenta- 
tion that it is inevitable that the discussion and illustrations in this book refer almost wholly to this 
field. At several points, Professor Kempthorne remarks critically on methods used in the social sciences. 
16 is of course true that experimentation in this field is very much more difficult than in agriculture. 
But it is not impossible nor are survey techniques as limited in their value as Professor Kempthorne 
suggests, in passing, in his first chapter. In the last year or two, attempts have been made to use 


experimental designs in investigations dealing with human populations and something has been learned 
about the difficulties and possibilities: 


. For the statistician working in the social sciences, one of the 
most interesting problems is, to what extent experimental techniques, such as those discussed іп this 
book, can be imported into his domain of study. For him, as for his statistical colleagues in other 
Sciences and fields of activity, Professor Kempthorne’s book is a most welcome addition to the 
literature. 


2. A. MOSER 
London School of Economics 


/ Associated Measurements. Ву М.Н. QuENoviLLE. Pp. x +242. Butterworth. 35s. 


The Design and Analysis of Experiment. By М. Н. QuENOUILLE. Pp. xiii 4- 356. 
iriffin. 36s. : 


To have produced these two books within a comparatively short time is a notable achievement. The 
books have much in common; they are advanced non-mathematical text-books on statistical methods 
excellently illustrated with examples, mostly biological, which are handled with great skill and with 


a clear sense of the practical purpose of the statistical analys 
carelessness of writing. 


s. The main fault is an occasional 


The discussion is most ly in terms of specific examples and, while the account of these is remarkably 
clear, the reader without а sound background of theory may find it difficult to apply the more advanced 
methods to cases slightly different from those considered by Mr Quenouille. For example, the method 
of least squares, which underlies much of both books, is, so far as I could see, never mentioned and is 
certainly never explained. It is true that the stage-by-stage fitting of groups of constants in a least- 
Squares model is often most expeditiously done by analysis of covariance, and that this technique is 
used repeatedly, But a statement in some of the more complicated cases of the form that is assumed for 
the underlying population would have given a unity which is at present lacking. An excellent point 


about both books is the сате taken to explain the assumptions necessary for the validity of the various 
methods, 


Ф 


Associated Measurements starts with a very good section on graphical analysis, including an account 
of non-parametric tests for association. The description here of tests of partial correlation is open to 
criticism. The author defines а coefficient of medial correlation between two variates т and y by 
diehotomizing the scatter diagram at the medians, finding the proportion, p, of observations in the 
Positive quadrant, and puttin 

q р 5 фи=4р—1. (1) 


For an infinite population ¢,, takes values plus and minus unity for perfect correlation, and zero for 
independence. Also when ж and y are independent, the distribution of фу does not depend on the form 
of the population, so that a non-parametrie test of independence may be obtained. For a bivariate 
normal population Фу is related to the product-moment correlation coefficient, Ры» by W. Е. 


Sheppard's formula пф 
Poy=sin ( =) % (2) 


Mr Quenouille next defines a partial medial correlation coefficient by 


Фау — Ф. Фа 
Фаз= PM (8) 
(1— 42.9.) 
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No derivation of this formula is given, and although it is used consistently, one suspects that it is 
о deriv: 
misprint for 
ашар фи. = батбай (4) 
oe (1— 93.) 01-02.) 


derived by analogy with product-moment correlations. 
results. To see this it is enough to consider some particular 
suppose that «=y+z, where у and z are independently di 
conditionally on 2, 2 is perfectly correlated with у. Ви 
partial 078 suggest that the correlation is unchanged, о | | 
The situation is more satisfactory if, conditionally on z, x is independent of y. For, in this case, Фаул 
and $2,.., while not necessarily zero, are always appreciably smaller than ó,,. However the rg mite 
of фу. and $4, . does not imply that p,,. is small. For example if p,, — 0-696, Р: py, = 0:891, the 


aking p,,, ete. near unity, Pzy.= тау be made 
nough has been said to make it clear that the 


Both (3) and (4) ean give quite misleading 
trivariate normal populations. For example, 
stributed in unit normal distributions. Then, 


uncritical use of partial 
Similar remarks may, incidentally, 
Prof. M. G. Kendall in his Rank Correla 


i 7, the expectation of 7 in samples from 
а normal universe, is given by the analogue of (2), 
T 
реңін (2). (8) 


When the underlying population is normal these difficulties may of course be avoided by trans- 
forming ф or т by (2) or (5) before calculating the partial correlation. There are a number of objections 
to this as a general procedure, 
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The Statistics of Bio-assay. Ву С. I. Briss. New York: Academic Press, Inc. 


Pp. iii+ 184. 83.50. 


Statistical Method in Biological Assay. Ву D. J. Етххву. London: Charles Griffin 
and Co. Ltd. Рр. хіх + 661. 68s. 
Probit Analysis. By D. J. Еіххку. Cambridge University Press. 2nd ed., pp. хіт + 318. 


35s. 


The statistical theory of bio-assay has been almost entirely developed during the past 20 years and 
has reached a reasonably stable state, so that the appearance of text-books by two of the foremost 
workers in this field is extremely welcome. The two general texts cover much the same ground, though 
Finney's book is by far the more detailed; Bliss's book originally appeared as a single chapter in 
Gyórgy's Vitamin Methods and is necessarily rather compressed. Both authors deal with parallel line 
assays with quantitative nnd quantal responses, slope-ratio assays, the use of covariance and the 
combination of poteney estimates; Bliss has an interesting section on control charts, while Finney 
includes several topies which do not arise in vitamin work and so fall outside the scope of the shorter 
book. 

Inevitably in so brief a compass, Bliss's book is little more than a collection of recipes—indeed he 
refers his readers to Finney for all mathematical details. Within this limitation it should prove very 
valuable: The needs of the practising biologist are always kept in mind, and the author is bold enough 
to suggest approximate rule-of-thumb techniques for important problems, rather than to decline to 
commit himself on the grounds that no exact solution is yet available. 

Finney's book is substantial to the point of being encyclopaedic, and some of the theory set out is 
well ahead of practice, at any rate in the assay field. The reader is supposed to be reasonably familiar 
with the basie statistical techniques, but apart from this, the suggested analytical methods are 
explained in full detail with an excellent set of worked examples. The underlying logic of bio-assay is 
diseussed at length, and a number of incomplete block designs are given which should prove of great 
practical use. Four chapters are devoted to quantal assays, and these are noteworthy for a very full 
discussion of the various alternatives to probit analysis that have been suggested from time to time. 
The book includes a 40 page Appendix of Tables, many of them new. 

The choice between these two books is not an easy one. Statisticians will probably prefer Finney. 
Biologists may like to keep Bliss's book on the bench (not least because of its handy size), but a study 
of Finney's book will be necessary for non-routine problems, and may well enable them to improve 
on their accepted techniques. 

A second edition of Finney's Probit Analysis is very welcome, the first edition having been unobtain- 
able for some time. Few changes have been made in the original material, but a substantial new chapter 
on Recent Developments has been added, and some of the tabular matter is new (the table of weighting 
coefficients now covers natural mortality up to 90 9 !). Alternatives to probit analysis are still rather 
inadequately treated (as the author himself points out), but this has been remedied by the very full 
discussion in the later book. The omission of one or two other topies, such as Tocher's analysis of 
grouped data and Finney's own work on graphieal methods and the adequacy of a single cyele of 
calculation, seems to be due to a long delay in publication—the preface to the new edition is dated 
December 1949. The standard of production leaves something to be desired; the new edition has been 
produced by an offset-litho process on rather thick paper, and the appearance of the page has suffered 
somewhat. It seems a pity that the price should still be almost double that of the original edition. 
Nevertheless, Probit Analysis remains an essential book for the many biologists and statisticians who 
work with sigmoid curves. 


M. J. R. HEALY 


Statistical Theory with Engineering Applications. By A. Harp. New York: 
Wiley; London: Chapman and Hall. 1952. Pp. xii-- 783. 72s. 

Statistical Tables and Formulas. By A. Harp. New York: Wiley; London: 
Chapman and Hall. 1952. Pp. 97. 20s. 


"This book has as its aim the provision, in simple and logical form, of those parts of statistical methods 
which are of use to the engineer in his daily work. Nevertheless, in its 800 pages it covers eonsiderably 
more ground, and that ground in greater detail, than might be expected from this simple aim. The result 
has been to produce a text-book which derives from first principles and illustrates with practical 
examples all the common statistical processes. 
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The book assumes no previous knowledge, and although advanced mathematical techniques are 
avoided the treatment and mode of development is mathematical in form. Nearly all the important 
results derived mathematically are subsequently illustrated practically. The _contents | include 
probability, graphs, the fundamental distributions, the analysis of variance, sampling investigations; 
the analysis of regressions and sequential analysis. It is noteworthy that the various distributions 
which are studied take up nearly 350 pages, which gives some idea of the thoroughness of the treatment. 
The many practical examples that are provided to illustrate the theory are drawn almost exclusively 
from engineering sources but they are put down in such a way that no detailed knowledge of engineering 
technique is required and the reader could easily substitute his own examples from his particular sphere 
of applications. The only criticism that could be levelled against the examples is that they are all 
chosen to illustrate just one point at a time, and one feels that, at any rate towards the end of the 
book, more complex illustrations should be used bringing in simultaneously a number of points from 
the theoretical development. As it is, few practical problems that the engineer is likely to meet with 

will be as simple as those given in the book. Also given in conjunction with each piece of theory are 

references to original memoirs and papers which describe practical illustrations, The reviewer feels that 
such references, instead of being seattered, would be better gathered together at the end of each 
chapter. One other minor point is that far too many equations are numbered, thereby giving them 
а certain prestige value. For example, in Chapter 18 no less than 157 equations receive numbers and 
few are referred to again nor are they likely to be quoted. 

The particular portion of the book that struck the reviewer 
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Statistical Decision Functions. By А. Warp. New York: John Wiley and Sons, 
Ine.; London: Chapman and Hall, Ltd. 1950. Pp. ix + 179. 44s. 


By the tragie death of Abraham Wald in December 1950 the world of mathematical statisties lost one 
of its most netive and ingenious minds. The present book, published just before his death, shows him 
to have been at the height of his powers. 

To explain what a decision function is, and to introduce concretely the leading ideas defined by 
Wald, we will consider the following simple example: 

An urn contains black balls and white balls, and the proportion p of black balls is unknown. А fixed 
number n of balls is drawn, at random with replacement, and r of them are observed to be black. 
We are required to make a guess, p’, at the value of p, and if we are wrong we lose an amount of money 
proportional to L=(p—p’)*. у 

Апу rule by means of which we arrive at our guess will be a decision function for this situation. 
Since n is fixed, the only quantity on which our guess can depend is r, but the value of r need not 
uniquely determine our guess; we may make p’ depend also on the toss of a coin or some other random 
event. Thus examples of decision functions d(r) in this situation are: 


(i) d(rr[n, (ii) 4(т)=}. 

(iii) d(r) =A(r/m) + (2). for some A=1—p, O<A<1, 
(iv) d(r)=r/n with probability 4 

у with probability $. 


In enses (i)-(iii) we take p’=d(r), while in case (iv), which is an example of a randomized decision 
function, we toss a coin, and if it gives heads we take p’=r/n, while if it gives tails we take p'— 3. 
If we could attach, to each decision function а, a definite monetary loss incurred by its use, we would 
find the problem of choosing between one decision function and another quite simple. We would simply 
choose that function which minimized the loss. But such a simple solution is impossible, for two reasons. 
For one thing, for a given value of p the value of r will vary randomly, and so the guess p’ will vary, 
and the loss L will vary along with р’; and for another thing, the distribution of p’ will in general 
depend on p, and, for a given p', the loss L depends on p. Wald disposes of the first difficulty by noting 
that, for given p, and given d, the distribution of p’, and hence that of L, is fixed, so that we may 
evaluate the mean value of L as a function of p and d. This function Wald calls the risk function, 


pip, d). 
In the four examples we have given the values of p are 


(1) рајт, (ii) +— pq. (iii) A*pg/n + jp — ра), (iv) $— Pa. — (1/n))/2. 


Wald then adopts the principle that it is reasonable to act so as to minimize the expected value of the 
loss, when this can be calculated. This implies that if we have two risk functions, p(p, d), Pp. dy), 
the first of which is, for a given value of p, less than the second, then, for this value of p, the first 
decision function is ‘better’ than the second, in the sense that it is reasonable to aet on the first 
decision function rather than the second. If the first decision function is sometimes better, and is 
never worse, than the second, Wald says the first is ‘uniformly better’ than the second. Taking our 
decision functions (i)-(iv), we see that (i) is best of all when p=0 or p= 1, since then its risk function 
is zero; while (ii) is best of all when p= 4; on the other hand, when m 3, (i) is worst of all, while when 
=0 or 1, (ii) is worst of all. If we from now on take A= ШЕ + уп), the risk function (iii) becomes 
+ an), which is independent of p and uniformly less than (iv). d (ii) is acria d better than 
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decision functions. Thus the problem of determining the class of all admissible decision functions is 
ral i се. 
Me үг port to formulate simple and general conditions which guarantee 
that exceptional Gases having the same effect as the ‘free observations’ of the previous paragraph 
are ruled out. He therefore introduces, instead of the class of all admissible decision functions, 
the notion of a ‘complete class’ of decision functions. A class of decision functions is said to be 
complete if for any decision function d not in the class there is a decision function d’ in the class 
which is uniformly better than d. If we confine our attention to decision functions belonging to 
a complete class we shall not lose anything thereby, since those we are ignoring are worse than some 
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general (apart from cases, for example, where a uniformly most powerful test exists). So some principle 
has to be adopted which will pick out from the multiplicity of Bayes solutions one which can in some 
sense or other be regarded as the ‘true’ solution. Bayes and Laplace, of course, did this by their famous 
‘equal distribution of ignorance’ postulate. Now Wald suggests, instead, the principle of ‘minimizing 
the maximum loss’. This means choosing that decision function for which the maximum of the risk 
function is least. In the example we have taken, this means using the decision rule (ii). For we have 
already seen that this rule is a Bayes rule, and hence admissible. This means that no other rule сап 
have a risk function which is never greater than the risk function for this rule. But since the risk 
function for rule (iii) is a constant, this means that any other risk function must have its maximum 
value greater than or equal to this constant. In other words, rule (iii) minimizes the maximum loss. 

Minimax rules have useful properties, perhaps the most useful being, as in our example, that the risk 
function for such rules is usually constant. But the principle of adopting the minimax rule cannot be 
regarded as any less arbitrary than Bayes's and Laplace’s principle of the equal distribution of the 
prior probability. Lf, in a long series of repetitions of the problem we have considered, in fact p has 
a distribution, concentrated near the ends of its range, then decision rule (i) will in fact be the best of 
those considered; and if in fact there is a concentration towards 2, then perhaps (iii), or even (ii), will 
be better than (i). The prior distribution really is a feature of the problem, and without knowing it we 
cannot obtain a unique solution, which has any absolute title to being called the ‘best’. 

From the point of view of statisticians mainly concerned with problems of inductive inference, 
perhaps the most useful aspect of Wald’s main result is, that in so far as every admissible rule is 
a Bayes rule, since the data enter into the posterior probability only via the likelihood function, this 
represents another way of seeing that the likelihood function * contains all the information in the data’, 
and hence that the problem of inference reduces to the problem of deseribing the likelihood function. 


А 6. А. BARNARD 


PUBLICATIONS OF U.S. DEPARTMENT OF COMMERCE, 
NATIONAL BUREAU OF STANDARDS 


(i) Table of arctangents of rational numbers. Applied Mathematics Series 11. 1951. 
Pp. 105. Price $1.50. 


This table gives aretangents for rational arguments m/n, i.c. it provides tan-! (m/n) to 12 decimals for 
all pairs of integers m, n with 0<т<п< 100. Also shown are cot-! (т/п) = фт — tan~! (т/п) (to 12 
decimals) and the sum of squares m? + 7. 

The main applications of the table are: 

(a) Tho conversion of cartesian co-ordinates n, m to polars ф = tan-! (т/п) and r= Vm? +n?) for 
which an auxiliary table of square roots would be required. 

(b) The evaluation of natural logarithms for complex arguments from 


log, (в + im) = $105, (т? +n?) + itan™! (m/n) 


for which an auxiliary table of natural logarithms would be required. 
(c) The evaluation of the T-function for complex arguments from the recurrence 


log, l'(z4- йу + 1) = $log, (23 + y?) + tan? (y/x) - log, T(x 4- iy). 
The latter property may make the table useful in statistical work in connection with moment generating 
functions of I'-variables or Laguerre series. ` 
Auxiliary tables provide formulae for the ‘reduction’ of arctangents in the forms 
* tan-!(m/n) = Ef;tan-!r; (Table 1, col. 3), 
tanim = Ef;tan-'m, (Table 2), 


by providing the integers fj, r; and m; « m. The table is provided with a short Introduction prepared by 
J. Todd and giving theoretical foundations and details of interpolation. 


(ii) Tables of the exponential function e*. Applied Mathematics Series 14 (supersedes 
MT 9). 1951. Pp. 537. Price $3.25. 


This table constitutes a revised edition of the second volume of the s of mathematical tables 
published by the Work Projects Administration of the City of New Yor аз originally published 
in 1939, It represents the most extensive table of the exponential function available if one takes into 
consideration both the decimal accuracy in e? and the fineness of the tabular interval in x. 
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It provides the following functions: 


Ld 
ег to 18 decimals for == 0-0000 (0-0001) 1-0000 
- to 15 decimals for 150000 (0-0001) 2-5000 
» to 15 decimals for ' 2-500 (0-001) 5-000 
` to 15 decimals for 5-00 (0-01) 10-00 ` 
e-* to 18 decimals for 0-0000 (0-0001) 2.5000 
i ех and e-? to 18 decimals for 0 (10-5) 10-4 Қ 
to 19 significant figures for 1 (1) 100 


and auxiliary tables. | | 4 

It should be noted that e” and e-* are given in separato t; 
argument, two separate pages have to be consulted. A comm 
preferable. E 


ables and, if both required for the same 
on argument for zx 2-5 would have been 


(iii) A guide to tables of the normal probability integral. 
Series 21. 1952. Рр. 16. Price 15с. 


This issue of the series provides, in Part I, a brief index of tables of the normal probability integral. In 
addition to the reference, information is provided on the typo of area tabulated (1.6. whether single 
tail, two tail, etc.) the decimal accuracy, the tabular intervals and auxiliary quantities (o.g. differences) 
provided for interpolation. Lay-out and arrangement of this index are thereforo similar to those adopted 
in An Index of Mathematical Tables by Fletcher, Miller & Rosenhead (Scientific Computing Servico Ltd., 
London and McGraw-Hill Book Co., New York, 1946), although the present list of specialized tables is 
more extensive. а ^ 

Part II gives the corresponding index for tables of t 

Part ILI gives principles of interpolation applicable t; 
of formulae. 
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e у ы У k b 
he inverse function (normal deviates). 
0 these tables in general and Part IV а summary. 
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Series 25 (supersedes A.M.S.I.). 1959. Pp. 60. Price 406, 
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Н. О. HARTLEY. 
_ Additional Note by Editor. 
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-B.S. Mathematical Tables Project. 
(v) Table of arctan x. Applied Mathematics Series 26. 1952, Pp. 188. Price 
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